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FOREWORD 


One  of  the  major  problems  facing  the  decision  makers  and  de¬ 
signers  of  knowledge-based  expert  systems  is  to  develop  a  meaningful 
model  of  uncertainty  associated  with  complex  and  dif f icult-tu- 
understand  subjective  and  objective  real-world  phenomena.  Stimu¬ 
lated  by  the  research  in  artificial  intelligence  and  related  areas, 
the  field  of  knowledge-based  expert  systems  blossomed  in  the  mid- 
1970' s  and  has  enjoyed  more  than  a  decade  of  vigorous  growth. 
Contributions  to  the  growth  have  come  from  many  theories  of  uncer¬ 
tainty  including  Bayes'  Statistics,  Zadeh's  Possibility  Theory  and 
Belief  Functions,  etc.,  and  have  embodied  many  disciplines  of 
mathematics  and  engineering  such  as  statistics,  mathematical  logic, 
measure  theory,  communication  theory,  operations  research,  systems 
theory,  psychology,  linguistics  and  computer  science.  Readers  who 
sample  the  literature  in  the  field  of  knowledge-based  systems  will 
soon  appreciate  the  color  and  vigor  that  make  up  the  discipline. 

This  diversity  in  model-making,  though  healthy  in  developing 
phases,  has  posed  some  serious  problems  before  the  students,  re¬ 
searchers,  as  well  as  users  of  the  models.  No  single  volume  on 
model-making  embraces  all  of  these  approaches. 

This  Research  Monograph  authored  by  Irwin  R.  Goodman  and  Hung 
T.  Nguyen  is  a  first  attempt  to  present  a  general  framework  for 
the  manipulation  and  explanation  of  uncertainty  in  the  design  of 
knowledge-based  expert  systems.  It  provides  mathematical  foun¬ 
dations  and  gives  extension  and  applications  of  various  theories  of 
uncertainty,  including  Bayes'  Statistics,  Zadeh's  Possibility  Theory 
and  Belief  Functions. 

Also,  this  monograph  addresses  topics  such  as  knowledge  re¬ 
presentation,  inference  rules,  and  combination  of  evidence.  The 
general  framework  is  based  upon  the  theory  of  formal  languages  and 
semantic  evaluations  from  different  systems  of  mathematical  logic. 
The  underlying  processes  of  reasoning  lead  to  decision  analyses  in 
various  contexts  of  knowledge-based  system  theory.  A  general  dis¬ 
cussion  is  presented  on  topics  such  as  generalized  set  theory  from  a 
viewpoint  of  multi-valued  logic  and  its  connection  with  Category 
Theory.  In  presenting  these  topics,  the  authors  do  an  impressive 
job  in  introducing  some  basic  concepts  which  provide  a  rigorous 
foundation  for  uncertainty  analysis.  This  monograph  also  gives  a 
survey  of  the  s cate-of-the-ar t  of  research  In  the  areas  of  fuzzy 
sets  and  Zadeh’s  Possibility  Theory. 

The  authors'  exposition  in  this  present  monograph  Is  an  out¬ 
growth  of  their  research  involvement  In  the  field  over  the  last 
decade.  Naturally,  this  serves  to  present  a  unified  approach  to 
uncertainty  models  and  its  applications  in  areas  such  as  medical 
diagnosis,  fault  analysis,  geological  prospecting  and  other  disci¬ 
plines  of  expert  systems  and  decision  analysis.  The  authors  deserve 
to  be  complimented  for  presenting  such  a  coherent  account  of  such  a 
diversified  field.  It  Is  a  seminal  work.  This  is  a  first  major 
contribution  which  gives  both  theory  and  applications  of  uncertainty 
models  and  poses  many  open  research  problems  and  unresolved  issues 
which  may  be  of  great  Interest  to  our  future  researchers. 

Madan  M.  Gupta 
University  of  Saskatchewan 
Saskatoon 
October,  1984 
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PREFACE 


This  monograph  is  an  attempt  at  presenting  a  unified  treatment 
of  uncertainty  modeling  which  has  been  developed  by  many  researchers 
over  the  past  two  decades.  Uncertainty  modeling  has  been  widely 
used  in  systems  analysis,  as  ,  e.g.,  in  expert  systems  treating  med¬ 
ical  diagnoses,  fault  determinations  and  geological  prospecting. 

The  general  inferential  problem  can  be  described  as  follows: 
Consider  some  real-world  system  of  interest  such  as  the  disease 
condition  of  a  given  patient,  the  possible  areas  where  oil  may  be 
found,  and  the  relationships  of  reported  targets  in  a  military  sce¬ 
nario.  By  a  system,  in  general,  we  mean  a  collection  of  possible 
real-world  situations  or  "states".  A  process  is  a  system  where  the 
collection  of  possible  states  may  also  depend  on  time  or  other  in¬ 
dices.  Associated  with  any  system  is  a  collection  of  attributes 
used  to  describe  the  system.  These  attributes  could  be,  e.g.,  posi¬ 
tion,  temperature,  sea  state,  blood  pressure,  degree  of  comfort, 
degree  of  happiness  or  any  other  description,  linguistic  or  numer¬ 
ical,  that  could  be  applied  to  a  system.  At  a  given  time  t  ,  often 
it  is  not  known  which  state  or  states  of  a  system  are  present.  This 
is  called  the  uncertainty  of  the  system  at  time  t  ,  This  leads  to 
the  establishment  of  an  experiment  in  order  to  gather  data  or  evi¬ 
dence  concerning  which  states  of  the  system  are  present  at  time  t 
and  to  what  degree.  The  design  of  the  experiment  will  of  course 
depend  on  the  type  of  system  considered.  Thus,  a  medical  situation 
requires  asking  a  patient  questions  resulting  in  some  linguistic 
(or  natural  language)  data,  while  the  problem  of  investigating  the 
safety  level  of  a  particular  street  corner  typically  involves  a 
statistical  experiment  with  numerical  data.  The  outcomes  of  the  ex¬ 
periment  may  be  sorted  according  to  possible  attributes  associated 
with  the  system.  Each  attribute  has  a  domain  of  possible  values 
(numerical  or  linguistic,  etc.,  and  possibly  time  dependent)  some  of 
which  are  observed  as  data.  Thus,  in  the  medical  situation  mentioned 
above,  some  attributes  could  be:  Patient's  degree  of  comfort  which 
could  include  the  possible  domain  of  outcomes  "feeling  badly," 
"can't  breathe,"  "feel  tired  and  thirsty,"  and  temperature,  which 
could  range  from  90  to  100  degrees  fahrenheit.  On  the  other  hand, 
for  the  safety  level  problem,  typical  attributes  might  include:  The 
number  of  accidents  at  that  corner  over  a  three  year  period,  with 
domain  being  integer-valued  and  the  traffic  flow  rate  described  by 
the  number  of  vehicles  passing  the  area  per  unit  time. 

Two  types  of  uncertainties  arise:  The  first  involves  only 
uncertainties  in  che  outcomes  of  the  experiment,  the  second  in¬ 
volves,  in  addition,  uncertainties  in  the  meaning  of  the  data. 
Examples  of  the  first  situation  occur  in  any  well-defined  random 
experiment  such  as  tossing  a  coin,  observing  queues,  and  recording 
observed  signals.  Examples  of  the  second  type  include  experiments 
involving  linguistic  outcomes,  where  for  information  processing 
purposes,  it  is  necessary  to  exercise  greater  care  in  the  modeling 
of  meaning  representation. 

Depending  on  the  nature  of  uncertainties  present  in  a  system 
and  the  data,  different  types  of  uncertainty  models  are  appropriate. 
An  uncertainty  model  may  consist  of  a  class  of  chosen  measures  or 
distributions  together  with  a  logical  system  or  calculus  of  opera¬ 
tors  on  these  measures.  On  the  other  hand,  uncertainty  models  may 
be  completely  heuristic,  reflecting,  e.g.,  the  common  knowledge  of  a 
group  of  individuals  or  "experts".  Ideally,  uncertainty  inode's  can 
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be  both  rigorous  and  heuristic  in  the  sense  that  the  mathematical 
aspect  of  the  model  contains  all  necessary  expert  knowledge  appro¬ 
priately  encoded.  Three  well-known  examples  of  uncertainty  models 
are  the  probabilistic  model,  Zadeh's  possibilistic  model,  and  the 
inexact  reasoning  model  in  medicine  using  o'rtainty  factors,  as 
developed  by  Shortliffe,  Buchanan  and  others. 

Recently,  expert  systems  have  come  to  the  forefront  in  the 
modeling  of  problems.  The  most  important  problems  in  designing 
expert  systems  are  those  of  knowledge  representation  and  combination 
of  evidence.  These  problems  are  of  a  semantic/mathematical  nature 
due  to  the  subjective  character  of  natural  language.  Zadeh's  pos¬ 
sibilistic  model  (also  called  PRUF)  and  related  generalizations  are 
attempts  to  deal  with  these  important  issues.  Once  a  class  of  un¬ 
certainty  measures  has  been  chosen  for  analyzing  a  system  at  hand, 
the  reasoning  process  can  be  implemented  within  an  appropriate  cal¬ 
culus  . 

Our  general  approach  is  through  the  concept  of  a  "dispersion" 
or  generalized  set  membership  function.  By  a  dispersion,  we  mean  a 
distribution  in  the  general  sense,  not  just  the  probabilistic  inter¬ 
pretation,  nor  necessarily  the  possibilistic  meaning,  i.e.,  fuzzy 
set  membership  function  or  possibility  distribution.  A  dispersion 
<$>A  of  an  attribute  A  having  domain  D(A)  is  usually  defined  as  a 

mapping  $  :  X  -♦  [0,1]  ,  where  X  2  D(A)  is  some  (base)  space 

representing  the  universe  of  all  possible  elements  (or  objects  or 
values)  of  interest  relative  to  A  which  may  be  in  numerical, 
linguistic  or  other  forms.  Then  each  element  x  e  X  is  assigned 
the  value  $a;::)  representing  the  degree  of  truth  or  uncertainty 

associated  with  x  with  respect  to  possessing  attribute  A  or 
being  "in"  A  considered  as  a  general  (or  fuzzy)  subset  of  X  . 

One  example  of  a  dispersion  is  derived  from  probability  theory 
where  (n,  a,  Pr)  is  a  probability  space.  In  this  case,  X  =  a 
(not  n  !)  ,  A  is  an  attribute  whose  domain  of  values  D ( A )  con¬ 
sists  of  the  subsets  of  rt  constituting  the  o-algebra  a  such  that 
<}>A  =  Pr  .  Note  that  the  sum  of  <1>A  over  a  in  general  exceeds 

unity  or  is  divergent.  (Of  course,  the  sum  of  $A  restricted  to 

any  disjoint  collection  of  sets  from  a  will  not  exceed  unity, 
compatible  with  the  usual  requirement  of  Pr  being  a  probability 
measure.)  Another  example  Is  furnished  by  the  attribute  "tall"  with 
X  representing  the  range  of  heights  of  men  in  feet.  Then  <f>A  is 

non-decreasing . 

Sometimes  dispersions  may  represent  more  general  concepts  of 
distributions  with  values  lying  in  a  lattice  or  a  Heyting  algebra, 
or  dispersions  may  be  understood  in  the  sense  of  Manes  as  used  in 
his  "fuzzy  theories".  The  modeling  of  uncertainty  will  be  carried 
out  within  a  rigorous  framework  called  a  general  logical  system. 

By  a  general  logical  system  we  usually  mean:  (1)  a  triple  of 
logical  operators  (C.  *,  f)  representing  the  connectives  "not", 
"and",  "or",  and  (ii)  a  dispersion  mapping  9  :  Set  -*  Set  ,  where 
Set  is  a  collection  of  sets  of  Interest  and  for  any  X  in  Set  , 
sr(X)  is  a  class  of  dispersions  over  X  .  In  practice,  due  to  the 
complexity  of  the  problem  considered,  a  hybrid  logical  system 
consisting  of  a  collection  of  logical  operators  may  be  more  appro¬ 
priate.  A  logical  system  may  or  may  not  be  truth-functional,  i.e., 
the  operators  may  or  may  not  be  in  functional  composition  form  with 
respect  to  dispersions.  For  example,  a  generalized  set  system 
(extending  Zadeh's  fuzzy  set  theory)  consists  of  the  triple  of  oper- 
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ators  4>not  ,  $and  ,  <pgr  ,  with  $not  :  [0,1]  -  [0,1]  a  negation, 

2 

4>&  and  :  [0,1]  -*  [0,1]  being  a  t-norm  and  a  t-conorm,  respec¬ 

tively,  with  9  :  Set  -♦  Set  ,  usually  chosen  so  that  Set  is  some 
subcollection  of  well-formed  sets  and  for  any  set  X  ,  9  (X)  =  *(X)  , 
the  class  of  all  attributes  A  (or  fuzzy  subsets)  of  X  .  Each  A 
may  be  identified  with  its  membership  function  <$>A  :  X  -»  [0,1]  and 

ft 

more  generally  9  (X)  may  be  identified  with  [0,1]  .  For  this 

system,  x  and  t  are  defined  by: 

*AxB(X'y)  =  WX)  '  VY)) 

<?WX<Y)  »  *0r(Vx)  ’  VY)) 

for  all  A  e  9 (X)  ,  B  e  s(Y)  ,  x  e  X  ,  y  e  Y  and  X  ,  Y  arbitrary 

sets  . 

Thus,  the  above  important  system  which  extends  classical  two¬ 
valued  logic  is  a  truth-functional  one.  On  the  other  hand,  consider 
the  following  system: 

Let  Set  =  {^1)  , 

J  (af)  =  |  Ap^  (R,  3^,  Pr)  is  any  probability  spacej  s  9(9,^)  , 

where  a.  is  the  Borel  o-algebra  of  R  and  where  <{>.  *  Pr  .  Let 

APr 

be  any  infinite  copula.  (This  condition  is  equivalent  to  forc¬ 
ing  the  generator  of  the  canonical  representation  of  <J>&  to  be  any 
fixed  completely  monotone  function.)  Let  $or  be  a  modular  (or 
equivalently  a  valuation  or  Frankian)  cocopula  with  respect  to  , 

i.e.,  <}>or(x,y)  =  x  +  y  -  <J>&(x,y)  .  Now  let  A  . Apr  e  y(»  )  , 

1  n 

n  £  2  ,  be  arbitrary.  Also,  let  a  «  r  be  arbitrary  and  define 

Ia  =  (-®,a)  «  and  denote  F^(a)  =  Pr.j((Ia)),  j  =  i . n  . 

For  these  Apr  ,...,Apr  e  9  (a^  )  and  a1,...,an  e  R  ,  by 
ri  rn  n 

Sklar 1 s  theorem  we  have  (see  also  sections  2.3.6  and  2.3.9): 


F1 . n(al . an>  =  VW'’ 


•'Van>> 


=  VPrl(V . Prn(Ia  >>  ' 

1  n 


which  yields  F  a  joint  probability  distribution  function 

x  f  /  n 

for  marginals  Fi'-*',Fn  ,  determining  uniquely  joint  probability 
measure  Pr  for  probability  space 

i ;  •  *  •  t  n 

(Rn,  an,  Pr  )  .  In  particular,  note  that  we  may  rewrite 

i  »  •  •  •  *  n 


1 ,  .  .  .  ,  n 


xii 


(I. 


{ I  )) 


Pr. 


Pr 


,n(al 


'an> 


which  is  a  truth-functional  form  for  the  system  relative  to  all 
"points”  I  e  a  j  =  l,...,n  .  If  =  prod  (ordinary  product), 

the  above  equation  may  be  extended  to  arbitrary  "points" 

j  =  l,...,n  ,  and  thus  the  logical  system  remains  truth- 


Cj  €  *l' 
functional 


However,  in  general,  if 


<t>&  *  Prod 


there  are  many 


cj  e  v  j  = 

1  ,  .  • 

.  . n  ,  such  that 

*A 

Prl,.. 

(C 

.  ,  n 

1 . Cn>  *  V*A_  (C1>'- 

1 

•••♦a  <cn))  ' 

APr  n 

n 

i  .  e  .  , 

Prl . 

n(Clx 

...xcn)  *  VPrl(Cl> . Prn(cn)}  ' 

because  of 
Indeed,  for 

the 

n  = 

additivity  constraint  of  probability  measures 
2  ,  let  C 2  =  CC1  ,  for  C1  e  such  that 

J  '  Pri.2(Cl 

*  C1 

)  -  Prli2(C2  x  ca>  s  |  . 

Prl,2<C2  X  Cl>  =  Prl,2(Cl  X  C2)  -  1  -  Pr 


1,2(C1  X  Cl) 


Thus,  Pr1(C1) 


Pr2(Ci: 


=  Prl(C2)  =  Pr2(C2)  =  ±  . 


Yet,  Prlf2(Cl  x  C2; 


=  *  -  Pr1<2(C1  x  Ci)  *  Pr1(2(Cl  x  ca, 


Thus,  Pr1  2(ci  x  C2^  is  not  3  °f  Pr^C^) 


pr2(C2) 


for 


all 


C1'C2  €  *1 


Again,  by  use  of  Sklar’s  theorem,  it  can  be  show,: 


that  all  joint  probability  measures  are  obtainable  through  use  of 
some  copula. 

Associated  with  any  general  logical  system  is  a  class  of  propo¬ 
sitions  many  of  which  are  of  the  form  <x  is  in  A>  for  any  x  >=  X 
and  any  attribute  A  with  domain  dom(A)  s  X  ,  any  set  of  interest. 
In  addition,  a  class  of  compound  propositions  may  be  constructed  as 
strings  of  propositions  employing  the  ^nsic  connectives.  Moreover, 
both  conditional  and  quantified  propositions  may  be  defined. 

All  of  the  above  may  be  considered  as  semantic  or  truth  evalu¬ 
ations  of  appropriate  generalized  set  membership  predicates  within  a 
formal  language. 

Models  of  uncertainty  can  be  used,  in  turn,  in  designing  rule- 
based  systems,  leading  to  decision  procedures  such  as  estimation, 
hypotheses  testing  or  confidence  region  construction. 


The  book  deals  with  mathematical  foundations,  applications  and 
extensions  of  previous  attempts  in  treating  uncertainty  for  know¬ 
ledge  representation,  relating  these  whenever  possible  to  semantic 
evaluations  of  well-formed  formulas  in  the  form  of  set  membership 
relations,  within  a  formal  language  structure.  These  approaches 
include:  Classical  objective  and  subjective  probability  theory: 
Zadeh’s  concepts  of  fuzziness  and  possibility  and  its  generaliza- 
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tions;  Dempster  and  Shafer's  upper  and  lower  probabilities  and 
belief,  doubt  and  other  uncertainty  reasures;  and  other  schools  of 
thought  such  as  Gaines'  uncertainty  logic  and  Watanabe's  system,  as 
well  as  various  compound  systems  and  briefly,  other  approaches  such 
as  Shackle's  surprise  theory  and  Cohen's  inductive  probabilities. 
Applications  investigated  include  .natural  language  symbolizations, 
semantic  content  evaluations  and  some  specific  rule-based  type 
systems,  including  the  target  association  problem. 

Although  many  researchers  have  contributed  to  various  aspects 
of  uncertainty  modeling  (Zadeh,  Manes,  Gaines,  Watanabe,  Fu  among 
others),  little  attention  has  been  paid  to  developing  a  unified 
treatment  which  directly  relates  probability  concepts  with  other 
uncertainty  ideas.  One  of  our  main  tools  in  presenting  a  unified 
approach  in  this  monograph  is  random  set  theory  (as  developed  ori¬ 
ginally  by  Kendall  and  Matheron)  .  We  are  concerned  here  with  the 
application  of  random  set  theory  to  uncertainty  modeling  rather  than 
further  deve! -pment  of  random  set  theory  by  itself  (an  extensive 
area  in  its  -ight  where  much  "esearch  remains  to  be  done.). 
However  the  -ctions  established  between  the  two  areas  could  be 
useful  in  random  set  theory  proper,  such  as  the  systematic  study  of 
coverage  functions. 

The  book  is  designed  to  be  primarily  a  reference  book  which  can 
be  utilized  by  both  applied  mathematicians  interested  in  the  foun¬ 
dations  and  connections  between  modern  approaches  to  modeling  of 
imprecise  information,  as  well  as  by  systems  engineers  and  others 
interested  in  studying  expert  systems. 

Lastly,  apropos  to  the  title,  this  monograph  emphasize  the 
development  and  analysis  of  uncertainty  models  to  be  used  in  know¬ 
ledge-based  systems  and  other  appropriate  structures.  Relatively 
little  is  presented  here  concerning  the  general  theory  ‘f  knowledge- 
based  systems,  the  area  being  treated  more  appropriately  within 
Cognition  and  Artificial  Intelligence  proper.  (Gregg  [98']  and  Barr 
and  Feigenbaum  [12']  present  good  introductions  to  the  subject  and 
in  Chapters  8  and  9  and  section  2.3.5  some  aspects  of  knowj.zdge- 
based  systems  are  considered.) 
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PROLOGUE 


Although  this  monograph  is  concerned  with  various  types  of 
multi-valued  logical  systems,  as  usual,  all  descriptions  of  those 
systems  and  their  proofs,  i.e.,  the  meta-level  of  this  book,  are 
in  terms  of  ordinary  two-valued  logic  and  set  theory.  This  is 
consistent  with  the  philosophy  that  on  one  hand  describes  both 
subjective  and  objective  real-world  phenomena  (and  related  models) 
within  a  multiple-truth-valued  context,  yet,  at  the  same  time,  in 
order  to  prevent  a  possible  infinite  regress  of  nested  multi-truth 
evaluations,  the  meta-level  of  description  must  be  classical. 
Perhaps  someday  (the  authors  are  unaware  of  any  work  in  this  direc¬ 
tion)  ,  serious  textbooks  will  be  published  which  might  typically 
include:  "Pr(Theorem  A)  =  0.3"  or  "Pr ( Pr ( Theorem  A)  >  x)  =  q(x)  , 
for  most  x  in  B"  ,  with  all  "proofs"  developed  through  multi¬ 
valued  or  multi-multi-valued  logic.  However,  until  that  time 
arrives,  we  must  be  content  with  this  apparent  paradoxical  situ¬ 
ation.  As  researchers  in  probability  theory,  information  theory, 
possibility  theory  and  multi-valued  logical  systems  in  general,  we 
all  seem  to  be  collectively  stating,  to  paraphrase  the  well-known 
adage:  "Don't  do  as  I  do,  do  as  I  say!" 
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CHAPTER  1 


INTRODUCTION 


1 • 1  Motivation  and  basic  framework  of  analysis. 

Many  problems  of  common  concern  involve  the  gathering  and  pro¬ 
cessing  of  both  statistical  and  linguistic-based  information.  For 
example,  the  outcome  of  a  jury  trial  or  police  investigation  may 
well  hinge  upon  both  expert  witness  testimony,  and,  to  a  lesser 
degree,  upon  other  sources.  In  deciding  the  culpability  of  Oswald 
in  the  assassination  attempt  upon  General  Walker,  an  expert  ballis¬ 
tics  analysis  group  indicated  "could  have  come,  and  even  perhaps  a 
little  stronger,  to  say  that  it  probably  came  from  this  ...  [gun]", 
while  the  FBI  investigating  team,  as  a  matter  of  policy,  avoiding 
the  category  of  "probable"  identification,  refused  to  come  to  a 
conclusion  [256].  Other  corroborative  evidence  included  a  written 
note,  also  requiring  an  expert  verification  of  authenticity,  and 
verbal  testimony  of  witnesses.  Based  upon  this  combination  of  evi¬ 
dence,  the  Warren  Commission  concluded  that  the  suspect  was  guilty. 

The  process  involved  in  the  above  situation  could  be  analyzed 
as  involving: 

(1)  Observed  data. 

This  included  markings  on  recovered  bullets  from  the 
assassination  attempt  and  laboratory-fired  ones  from 
plausible  gun  sources  and  handwriting  from  the  note  and 
previously  established  characteristics  of  the  suspect's 
handwriting. 

(2)  Error  distributions  or  trends  of  occurances . 

This  was  indicated  by  the  reliability  of  the  data 
in  representing  what  was  observed.  Factors  that  had 
to  be  accounted  for,  included:  quality  of  the  note 
paper,  bullet  distortions,  and  reliability  of  witnesses. 

(3)  Inference  rules  which  reflect  the  legal  expertise  and  ex¬ 
perience  of  the  Commission  in  relating  the  matching  of 
evidence  with  conclusions  concerning  final  culpability. 

Another  problem  of  importance  involving  combination  of  evidence 
occurs  in  tracking  and  "correlation",  i.e.,  data  association  of 
targets:  Two  target  histories  of  interest  are  being  considered  by  a 

sensor  operator  for  possible  correlation.  He  receives  information 
labelled  as  A,  B,  C  : 

A:  Two  dimensional  position  observations  with  associated  up¬ 

dated  error  ellipses  of  some  prescribed  confidence  level. 

B:  Reports  concerning  tentative  classification  of  targets, 

such  as  Filipino  type  Q4  or  Liechtensteinian  type  R7.9. 
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C:  Visual  sightings,  including  partial  identifications,  clues, 

hull  lengths,  mast  shapes,  etc. 

Clearly,  if  information  categories  B  and  C  are  ignored, 
statistical  hypotheses  testing  theory  may  be  applied  to  A  to  es¬ 
tablish  a  standard  weighted  metric  (Mahalanobis  distance,  see,  e.g., 
Rao  [212],  Chapter  8)  for  testing  for  correlation  and  determining 
the  level  of  correlation  (statistically)  between  the  two  track  his¬ 
tories  of  interest.  Rough  gating  procedures  could  then  be  added  to 
see  if  B  and  C  confirm  or  perhaps  contradict  the  crucial  geolo¬ 
cation  criterion  results  for  A  .  (See,  e.g.,  Goodman,  et  al .  [87], 
Bar-Shalom  [12],  Reid  [213]  and  Bowman  [23]  for  descriptions  of 
tracking/correlation  systems  utilizing  the  related  procedures.) 
But,  how  should  gates  or  their  softer  distributional  analogues  be 
systematically  established  for  B  and  C  and  integrated  with  the 
results  for  A  ?  Furthermore,  can  we  use  human  in-field  operator 
experience  to  relate  in  some  way  information  matches,  mismatches, 
and  everything  in  between,  occurring  for  categories  B  ,  C  and  A  , 
as  well,  for  correlation?  This  problem  will  be  treated  in  detail 
in  Chapter  9.  Certain  aspects  of  this  problem  have  been  previous¬ 
ly  addressed  through  the  establishment  of  the  PACT  (Possibilistic 
Approach  to  Correlation  and  Tracking)  algorithm  (see  [80],  [82], 

[83],  [86]).  In  the  above  example,  the  parameter  of  interest  is  the 
true  correlation  level.  The  above  example  motivates  us  to  formulate 
the  following  scheme  for  dealing  with  parameter  estimation  problems. 
We  need  to: 

(1)  Categorize  the  incoming  information  into  subcategories  as 
A,  B,  C,  ...  .  These  categories  should  bs  carefully  chosen  for  non- 
redundancy,  if  possible.  Further  analysis  of  taxonomy  should  prove 
useful  (for  general  procedures,  see,  e.g.,  Jardine  and  Sibson  [ 1 24 • ] 
or  for  specialization  to  social  or  psychological  context,  see 
Nowakowska  [195*]). 

(2)  Establish  a  rigorous  and  systematic  framework  for  quan¬ 

tifying  information.  This  may  be  identified  with  the  problem  of 
determining  the  natural  domains  of  attributes  A,  B,  C,  ...  .  For 

example,  in  the  above  example,  A's  domain  consists  of  all  ordered 
pairs  of  2  by  l  vectors  and  ellipses,  while  for  B,  perhaps 
simplified  labels  such  as  Q4 ,  R7.9,  S6,  ...  will  do.  . 

(3)  Derive  matching  level  tables  or  equivalently  error 
"distributions"  (in  a  sense  possibly  extending  the  classical  sta¬ 
tistical  ones)  that  can  occur  between  what  is  reported,  observed  or 
predicted,  depending  on  the  context,  and  what  the  true  values  are 
for  each  attribute  category. 

(4)  Determine  relative  weights  of  importance  between  the  var¬ 
ious  attribute  categories.  For  example,  how  much  will  we  tolerate  a 
total  mismatch  with  respect  to  B  ,  when  a  relatively  good  match  oc¬ 
curs  relative  to  A  ?  This  weighting  problem,  to  a  certain  extent, 
is  addressed  implicitly  through  the  establishment  of  inference  rules 
as  discussed  below. 

(5)  Establish  logical  connections  -  based  on  either  physical 
considerations  or  human  operator  experience  -  between  the  various 
information  categories.  The  connections  could  be  either  in  the  form 
of  inference/modus  ponens  rules  or  posterior  distributions  con¬ 
straining  or  delineating  the  unknown  parameter  of  interest.  (See 
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Goodman  [82],  [86]  for  earlier  approaches  to  parameter  estimation 

when  some  of  the  informational  input  is  in  a  linguistic  format.) 

More  generally,  we  might  ask:  How  do  we  model  uncertainty  and 
conclusions  concerning  this  within  a  general  framework? 

Obviously,  once  we  remove  ourselves  from  "hard"  statistical 
information,  subjectivity  and  personal  interpretive  variation  play 
important  roles.  The  logical  connections  mentioned  in  item  (5) 
above  are  usually  given  in  the  form  of  modus  ponens  inference  rules 
or  posterior  distributions.  In  the  latter  case,  at  least  for  the 
classical  statistical  situation,  Bayes'  theorem  is  usually  invoked 
with  respect  to  more  basic  conditional  data  and  prior  distributions. 
For  the  former  case,  predicates  restricting  or  describing  the 
unknown  parameter  of  Interest  and  other  parameters  or  values  de¬ 
termined  through  various  attribute  categories  are  related  through  an 
"if  (  )  then  (  )  "  structure.  Schematically,  we  have  the  restric¬ 

tions  on  unknown  parameter  Q  ,  say: 


4>  ( Q 1 2 )  for  "If  P(Z)  then  V(Q)  " 


The  left  hand  ‘side  represents  the  posterior  distribution  (possi- 
bilistic  or  ordinary,  see  Zadeh  [281]  for  clarification  and 
explanation  of  possibilistic  distributions)  of  Q  given  data  Z  , 
while  the  right  hand  side  represents  the  inference  rule  "If  P(Z) 
is  true,  then  V(Q)  must  also  hold,"  where  P(Z)  Is  some  pre¬ 
dicate  describing  Z  and  V(Q)  is  some  predicate  describing  Q  . 

For  example,  let  Z  =  (Z(1),  Z(;)>)  ,  with  Z(i)  =  (z[1)  ,  Z^15)  and 

Z(;)1  =  (Z^,  Z^)  ,  the  superscript  i  referring  to  data  for 

track  history  i  and  j  for  track  history  j  ,  and  the  subscript 
1  referring  to  attribute  category  A  and  2  referring  to  attri¬ 
bute  B  .  Then,  we  may  let,  for  example, 


and 


P ( Z )  *  "  Z  ^ 1  *  and  Z^^  very  mildly  (or  to  a  low  level)  match 
Z^*  and  Z^'1  strongly  match". 

V ( Q )  =  "correlation  level  Q  Is  high  at  least". 


The  words  "very  mildly",  "strongly",  "high  at  least"  could,  if 
sufficient  information  were  present,  be  replaced  by  more  quanti¬ 
tative  values  such  as  ct  1  ,  a2  r  a3  ,  where  each  is  some  number 

between  0  and  1  indicating  the  intensity  of  matching  level. 
(Again,  see,  e.g.,  [80]  or  Chapter  9  for  clarification  and  elabo¬ 
ration  of  this  idea.)  It  should  be  noted  that  many  posterior 
distributions  and/or  inference  rules  may  be  present  which  restrict 
the  possible  values  of  Q  given  Z  . 

With  the  problem  modeled  according  to  the  above  mentioned 
scheme,  the  next  basic  question  concerns  the  actual  mathematization 
or  translation  of  the  problem  into  symbols  which  can  be  manipulated 
according  to  some  established  calculus.  How  do  we  translate  the 
problem  into  a  consistent  rigorous  framework?  What  means  do  we  use 
to  translate  the  atomic  or  fundamental  Information  parts?  How  are 
compound  informational  parts  to  be  exhibited  through  appropriate 
choice  of  operators?  Which  parts  of  the  problem  are  more  amenable 
to  ordinary  statistical/probabilistic  analysis  and  modeling,  and 
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which  parts  to  possibilistic  modeling?  Certainly,  natural  language 
descriptions  appear  more  easily  put  into  possibilistic  structure 
than  classical  probabilistic  ones.  (See  Dubois  and  Prade  [51]  for  a 
survey.)  On  the  other  hand,  even  concepts  that  may  appear  statis¬ 
tically  describable  may  also  be  modeled  via  a  dispersion  theory. 
For  example,  classification  often  entails  rather  overlapping  pos¬ 
sible  values  or  classes,  based  on  various  contributing  factors.  In¬ 
deed,  some  classes  may  actually  be  subsets  of  others.  In  addition, 
it  is  possible  that  the  exact  definitions  of  the  classes  themselves 
are  vague,  and  moreover,  the  relations  between  the  classes  may  not 
be  clear.  For  example,  B  could  contain  E  and  F  ,  which  are 
defined  by  knowledge  of  frequencies,  ship  size,  shape,  and  number  of 
emitting  energy  sources  on-board.  Overlaps  between  E  and  F  may 
abound.  Thus,  it  is  not  appropriate  to  consider  ordinary  probabil¬ 
ity  distributions  over  B  since  the  elementary  events  E  and  F  , 
among  others  for  example,  are  not  distinct  or  disjoint.  Rather, 
because  of  the  overlapping  flavor  of  B  ,  either  random  sets  of  B 
or,  more  generally,  certain  equivalent  classes  of  random  subsets  of 
B  should  be  used.  (Equivalence,  as  used  here,  is  in  the  sense  of 
having  the  same  one  point  coverage  function,  see  Chapter  5.) 

Analysis  of  uncertainty  in  its  most  general  form  requires 
modeling  and  measuring  imprecise  concepts  expressed  by  natural  lan¬ 
guage.  By  natural  language  we  mean  that  medium  through  which  all 
human  ideas  are  formed,  including  classical  set  theory  and  two¬ 
valued  logic,  as  well  as  more  ambiguous  concepts,  such  as  proba¬ 
bility  distributions  representing  measurements,  and  the  still  more 
ambiguous  terms  occurring  in  ordinary  speech  descriptions  such  as 
"tall",  "happy",  "within  3  units  of",  "close  to",  “almost  all", 
"there  exists",  "approximately  a  subset  of  K  to  degree  0.6", 
"member  of  set  G" ,  "approximately  a  member  of  H  to  degree  0.4". 

The  difficulty  in  using  natural  language  modeling  as  a  direct 
tool  for  analysis  is  emphasized  by  the  lack  of  organization  of  the 
field.  Despite  the  heavy  influence  of  pioneer  linguists  Sapir, 
Bloomfield,  Jespersen,  Boas,  Whorf  (establishing  the  famous  Whorf- 
Sapir  Hypothesis  on  language  restricting  the  thoughts  of  native 
speakers),  and  the  later  work  of  formal  linguists  Hiz,  Harris,  and 
Chomsky,  among  others  (see  [35],  [107]),  what  is  evident  is  that  a 
unified  theory  of  linguistics  entailing  both  semantics  (meaning)  and 
syntactics  (operations  or  form)  Is  needed  which  is  suitable  for 
complete  mathematization .  (One  candidate  approach  is  due  to  Zadeh 
[290]  using  fuzzy  set  theory.  See  also  the  work  of  Grenander  for  a 
different  perspective,  [98]). 

The  basic  analysis  of  uncertainty  modeling  revolves  around  a 
series  of  general  topics: 


a)  Internal  vs  external  modeling. 

In  the  internal  approach,  explicit  analytic  relations  are 
sought  connecting  one  approach  of  uncertainty  to  another.  For 
example,  Negoita  and  Ralescu,  through  their  representation  theorem 
[184]  tied  up  very  neatly  classical  fuzzy  set  operations  with  "flou" 
(equivalently,  level)  set  operations.  As  another  example  of  the 
internal  approach,  Goodman,  Orlov,  Nguyen,  and  Htthle  among  others 
(see  [90],  [197],  [190],  [116])  demonstrated  direct  connections 
between  random  sets  and  fuzzy  sets  and  certain  of  their  operations. 

In  the  external  approach,  unifying  generalizations  are  sought 
which  reduce  to  various  approaches  to  uncertainty  modeling.  Here 
the  work  of  Hirota  [111],  Schefe  [224]  and  Gaines  [68]  may  be  cited 
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for  developing  structures  that  simultaneously  generalize  probability 
and  fuzzy  set  systems. 

The  most  far  reaching  work  in  this  area  is  due  to  Manes  [169] 
who  derived  a  collection  of  axioms  which  not  only  generalize  pro¬ 
bability  theory  and  Zadeh's  (min-raax)  fuzzy  set  theory  [276' ],  but  a 
whole  host  of  other  systems,  including  topological  neighborhood 
theory  and  credibility  theory.  (However,  see  Chapter  7  for  details 
on  Manes'  work,  where  it  is  shown  certain  restrictions  must  be 
imposed  on  general  logical  systems  to  satisfy  Manes'  axioms.) 


b)  Prelinquistic  concepts  and  ideas  obtained  through 
natural  language.  Cognition. 

This  topic  concerns  itself  with  the  ability  of  natural  language 
to  express  ideas  accurately  and  succinctly  as  well  as  the  form¬ 
alizing  mathematiz ing  of  natural  language  for  dealing  with  uncer¬ 
tainties.  Comments  were  made  previously  on  the  lack  of  progress  in 
this  extremely  difficult  area.  Ironically,  we  can  express  in  a  few 
words,  ideas  such  as  love,  happiness,  temporal  vague  concepts,  am¬ 
biguous  descriptions  -  which  are  perfectly  understandable  to  another 
reasonably  educated  speaker,  as  well  as  various  combinations  and 
operations  on  these  ideas  -  yet  we  cannot  express  these  concepts 
easily  within  a  rigorous  frame-work  in  terms  of  all  the  component 
primitive  or  atomic  parts.  On  the  other  hand,  "complicated"  mathe¬ 
matical  terminology  such  as  is  typically  found  in  category  theory  or 
algebraic  topology  or  deductive  logic  studies,  really  express  con¬ 
cepts  far  simpler  in  nature  than  what  language  can  express.  (Of 
course,  we  cannot  discount  the  ability  of  language  to  represent  - 
albeit,  how  awkward  -  pure  mathematical  concepts.) 


c)  Concept  of  dispersions. 

It  is  the  firm  conviction  of  both  authors,  as  well  as  others, 
(see  Goodman  [89],  [90])  that  because  mathematical  analysis  has 
shown  that  fuzziness  is  a  weak  form  of  randomness,  i.e.,  a  looser 
type  of  randomness  without  the  constraints  of  a  probability  distr¬ 
ibution  entailed,  dispersions  and  their  operations  (see  Chapter  2) 
are  natural  tools  to  express  linguistic  concepts  rather  than  pro¬ 
bability  distributions.  Thus,  the  fundamental  idea  of  a  point 
partially  belonging  to  a  set  with  degree  specified  as  some  number 
between  0  and  1  may  well  be  taken  as  an  intuitive  concept 
representing  the  possibility  that  the  point  is  in  the  set,  rather 
than  the  probability  that  it  is  in  the  set  (the  set  is  now  consi¬ 
dered  as  a  random  set).  See  Chapter  5  for  the  development  of 
explicit  relations  between  dispersion  and  random  sets.  Essentially, 
a  dispersion  is  equivalent  mathematically  to  the  class  of  all  random 
sets,  which  have  in  common  the  same  one  point  coverage  function, 
namely  the  dispersion  itself. 


d)  Development  of  general  logical  systems  and  problem  of 
ambiguities . 

This  topic  is  basic  in  developing  a  unified  approach  to  uncer¬ 
tainty  modeling.  Too  often  in  the  past  (e.g.,  Dubois  and  Prade 
[51])  myriad  distinct  logical  systems  have  been  proposed  for  use  in 
modeling  uncertainties,  without  paying  attention  to  the  inherent 
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ambiguity  of  definitions  present.  More  specifically,  consider  the 
problem  of  defining  an  appropriate  concept  for  the  intersection  of 
two  generalized  sets.  Originally,  Zadeh  proposed  that  minimum  as  an 
operation  on  che  respective  membership  functions  was  the  most  appro¬ 
priate.  Lator,  Bellman  and  Giertz  [14]  were  among  the  first  to 
justify  on  a  rigorous  basis  the  use  of  minimum  as  an  intersection 
operation.  (See  also  the  survey  of  Klement  [140]  on  rigorous 
characterizations  of  various  fuzzy  set  operations,  including  inter¬ 
section,  union  and  complementation. )  However,  the  justification 
required  certain  constraints  (such  as  mutual  distributivity )  which 
are  not  realistically  required  within  a  general  setting.  Other 
definitions  for  intersection  resulted,  including  the  use  of  product, 
also  justified,  with  again  appropriate  restrictions  (again  see 
Dubois  and  Prade  [51]  for  a  survey).  Clearly,  minimum  and  product, 
while  both  extending  ordinary  intersection  (relative  to  zero-one 
type  membership  functions)  are  considerably  different.  Zadeh' s 
original  fuzzy  set  system  and  its  many  generalizations  are  all 
truth-functional  systems,  i.e.,  evaluations  of  logical  relations 
depend  on  functional  compositions  as  does  intersection,  in  parti¬ 
cular.  Other  non-truth-functional  definitions  for  intersection 
could  be  used  such  as  that  for  probabilistic  logic.  (See  Chapter  2 
for  further  details.)  Which  one  to  choose  or  not?  The  answer  tc 
this  problem  may  well  lie  in  defining  an  entire  class  of  dispersion 
operations  -  not  just  a  single  operation  -  which  in  the  most  natural 
way  abstracts  the  ordinary  concept  of  intersection .  At  least  for 
truth- functional  systems,  such  a  class  has  been  proposed.  (See, 
e.g.,  Klement  [140],  Goodman  [88]).  This  class  is  called  the 
t-norms,  a  term  borrowed  from  a  branch  of  probability  theory  totally 
independent  of  dispersion  theory  and  developed  by  Schweizer  and 
Sklar  [226]  based  on  an  earlier  proposal  of  Menger .  These  operators 
are  symmetric,  associative,  usually  assumed  also  continuous,  obey 
certain  boundary  conditions  for  compatibility  with  ordinary  inter¬ 
section,  are  nondecreasing  in  their  arguments,  and  numerically  are 
bounded  above  by  (the  largest  t-norm)  the  minimum  operation.  (See 
again,  Schweizer  and  Sklar  [226].  See  also,  Haack  [101]  and  Rescher 
[214]  for  listings  of  multi-valued  logical  systems  where  special 
cases  of  t-norm  and  t-conorm  operators  are  used  for  "and"  and  "or"  . 
The  associativity  anl  symmetry  conditions  above  may  be  dropped,  with 
some  loss  of  structural  properties  in  modeling  conjunction.  (See 
sections  Z.3.5,  2.3.6.) 

Similarly,  other  classes  of  definitions  may  be  developed  ior 
union  (t-conorms),  complement  (negations,  often  in  the  form  of 
involutions),  and  in  turn,  these  general  definitions,  through  multi¬ 
valued  logic,  may  be  used  to  develop  general  compound  dispersion 
operations  and  relations,  including  implication,  the  quantification 
"for  all",  and  "there  exists,"  as  well  as  various  intermediate 
quantifications  such  as  "most",  "few",  and  general  subset  and 
arithmetic  relationships.  In  addition,  this  leads  to  the  general 
concept  of  conditional  dispersions  or  equivalently  condi t'onal 
attributes  (analoguous  to  conditional  probability  distributions),  a 
form  of  Bayes'  theorem  and  in  turn,  a  theory  of  sampling  based  on 
dispersions  (Goodman  [86]).  (See  also  sections  8.2,  2.6  and  the 
appendix  at  the  end  of  Chapter  10.) 

Even  with  the  general  unifying  approach  as  described,  problems 
of  ambiguitv  of  definition  still  arise.  For  example,  the  quantified 
expression  "for  all  x  In  X  ,  if  x  has  property  A  then  x  has 
property  B  "  is  definable  by  the  relation  as  expressed  in  English 
by  the  repeated  form 
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"  <(X1  e  A)  a  B>  oi  <(X2  e  A)  »  B>  &,... 
<(Xj  «  A)  =»  B>  means  "if  Xj  is  in 
equivalently  "if  Xj  has  property  A 
The  Xj  vary  over  the  entire  universe 


,  &  c(xn  e  A)  »  B>  "  where 
A  then  Xj  is  in  B  "  or 
then  Xj  has  property  B  "  . 
of  discourse  X  and  A  and 


B  need  not  represent  ordinary  sets  but  some  general  attributes 
corresponding  to  dispersions.  On  the  other  hand,  the  same  concept 
could  conceivably  be  expressed  directly  by  a  unary  operation  on  a 
properly  chosen  conditional  form  or,  alternatively  on  the  general- 

1  n 

ized  cardinality  -  x  $ ,  .  n.(x.)  ,  where  n  =  Card(X)  .  This 

n  T(A»B)  j 


unary  operation  is  a  generalization  of  "for  all",  i.e.,  a  monotoni- 
cally  increasing  function  over  the  unit  interval  which  rises  sharply 
towards  one  near  domain  value  one  and  which  otherwise  is  zero  before 
these  values.  Similarly,  monotonic  operations  can  be  used  in  dis¬ 
persion  tneory  to  define  "almost  all",  "at  least  most",  etc.  It  is 
easy  to  see  that  in  general ,  though  again  two  concepts  extend  the 
ordinary  meaning  in  zero-cne  logic  set  theory  of  "for  all,"  they 
represent  two  different  approaches  to  the  universal  quantifier.  It 
should  be  noted  that  the  two  approaches  to  defining  the  universal 
quantifier  depend  upon  the  logical  system  chosen.  Thus,  in  a  truth- 
functional  context,  the  choices  of  t-norra  or  copulas  representing 
conjunction,  complement  operators  representing  negation,  and 
t-conorms  and  co-copulas  representing  disjunction  all  play  a  key 
role.  Which  ones  to  choose?  Also,  there  is  the  fundamental  problem 
in  modeling  dispersions  corresponding  to  the  original  linguistic 
concept.  What  individual  variation  of  response  should  be  allowed? 
How  specific  should  the  universe  of  discourse  be?  For  example,  when 
considering  the  attribute  “long",  do  we  consider  ships,  cars  or 
both?  Is  there  some  errand  scale  where  "long"  can  be  quantitatively 
established  through  a  lispersion,  other  that  the  obvious  fact  that 
It  is  some  monotone  increasing  dispersion?  This  problem  is  analo¬ 
gous  to  that  of  modeling  an  appropriate  probability  distribution: 
Parameterize  the  allowable  family  of  dispersions  and  then  choose  the 
most  appropriate  value  or  values  of  the  parameter  (and  hence,  cor¬ 
responding  dispersion  from  the  family  through  some  estimation 
technique  based  on  empirically  obtained  evidence).  (See  Lakoff 
[150],  Hersh  and  Caramazza  fl09],  Zlmmermann  [291],  e.g.,  for  vari¬ 
ous  approaches  to  the  modeling  of  such  functions.)  Finally,  the 
functional  extension  problem  should  be  mentioned,  where  ordinary 
functions  or  relations  between  sets  are  extended  to  (still  ordinary) 
functions  between  generalized  sets.  Even  if  this  ambiguity  is 
resolved  other  problems  arise  in  designing  how  dispersions  should  be 
logically  combined.  More  specifically,  in  classical  logic,  any 
multiple-argument  (with  zero-one  values)  Boolean  truth  function  f 
(polynomial)  may  be  expressed  as  f  =  9o(ll,n0t  <  $an<3  $or)  where 

d>  „  ,  <t>  ,  ,  d>  are  the  classical  Boolean  functions  (by  the  com- 

pleteness  propertv  of  the  above  operations  -  see,  e.g.,  Enderton 
[54]).  Then  f  can  be  extended  to  have  a  domain  and  range  where 
(0,1)  is  replaced  by  7  [0,1]  .  In  turn,  f  may  be  used  as  Manes 

[169]  and  Gaines  [68]  proposed  to  combine  dispersions  -  on  the  other 
hand,  the  general  logical  operators  for  negation,  conjunction  and 
disjunction  as  discussed  earlier,  combine  dispersions  differently, 
(See  Chapter  2.3.6  for  details.) 
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e)  Relationships  between  dispersions  and  their  operations  and 
probabilistic  concepts. 

The  details  of  these  relationships  will  be  spelled  out  in 
Chapters  5  and  6.  Recalling  the  last  comments  of  subsection  c), 
given  a  dispersion,  it  may  be  expeditious  to  choose  one  particular 
random  set  one  point  coverage  equivalt.it  to  it.  Which  one  to 
choose?  How  much  information  loss  occurs  when  one  random  set  is 
chosen  as  opposed  to  the  entire  equivalence  class?  Could  some 
mathematical  criterion  be  used  to  weed  out  this  random  set  -  such  as 
maximum  entropy?  What  about  semantic  content?  For  example,  consi¬ 
der  the  simple  attribute  "tall".  Clearly,  this  is  represented  by  a 
monotone  increasing  dispersion,  i.e.,  the  membership  function  of  the 
attribute  "tall"  must  be  monotonically  increasing.  However,  it  can 
be  shown  that  among  the  equivalent  random  sets  for  any  such  monotone 
increasing  dispersion,  two  considerably  different  random  sets  can  be 
constructed:  The  Sy-type  ,  which  is  a  random  interval  with  right 

end  point  fixed  at  the  maximal  element  in  the  universe  of  discourse, 
and  the  T-type  ,  which  has  a  highly  disconnected  sturcture,  not  an 
interval.  Clearly,  the  first  is  more  compatible  with  the  concept  of 
"tall"  -  if  one  point  is  covered  randomly  by  such  a  random  set  re¬ 
presenting  "tall",  shouldn't  all  points  to  the  right,  representing 
larger  heights  also  be  covered?  Similarly,  there  may  be  a  most 
natural  choice  of  random  set  representation  for  a  given  attribute, 
when  the  corresponding  dispersion  is  of  some  prescribed  type,  such 
as  unimodal,  continuous,  discrete,  or  is  in  the  form  of  a  step 
function? 

In  a  related  vein,  we  may  pose  the  question  as  how  should 
semantic-based  information,  and  thus  dispersions,  be  combined  with 
independently  derived  statistical  information  concerning  a  common 
unknown  parameter  vector?  Finally,  it  is  of  importance  to  ascer¬ 
tain,  through  the  relationships  mentioned  above,  if  random  set 
theory  could  be  used  to  derive  properties  for  dispersions. 


1 . 2  Background  and  survey. 

A  key  problem  currently  faced  in  AI  (Artificial  Intelligence) 
and  related  systems  is  the  determination  of  the  most  appropriate  way 
to  model  and  integrate  natural  language  information  with  numerical 
and  statistical  information.  There  is  no  question  that  the  solution 
to  this  problem  must  -  at  the  very  least  -  draw  upon  the  fields  of 
cognitive  learning  processes  and  psychometric  theory,  formal  lin¬ 
guistics,  including  an  integrated  view  of  semantics  and  syntactics, 
probability  and  generalized  forms  of  measure  theory,  and  both 
classical  and  multivalued  logic.  It  is  our  belief  here  that  an 
encompassing  theory  of  uncertainty  can  be  established  employing  all 
of  the  above  mentioned  disciplines  which  may  successfully  treat  this 
basic  problem  in  AI . 

Some  Investigators  in  the  field  of  cognitive  rationality  have 
come  to  the  conclusion  that  rational  decision  making  by  humans  is 
really  unattainable  [128].  Others,  such  as  Cohen  [38],  hold  the 
belief  that  Intuition,  properly  systematized,  can  serve  as  a  basis 
for  the  choice  of  a  normative  theory  of  decision  making.  In  turn, 
the  latter  depends  critically  upon  which  fundamental  theory  or 
theories  of  uncertainties  and  belief  to  accept.  Polemics  aside, 
techniques  will  have  to  be  developed  which  in  some  reasonable  sense, 
model  uncertainties  as  faithfully  as  possible  so  that  applications 
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to  decision  making  -  such  as  the  reconciliation  of  incoherent  data 
as  discussed  by  Brown  and  Lindley  [25]  -  may  be  carried  out  on  both 
the  basic  intuitive  level  and  normative/rigorous  level.  The  phi¬ 
losophy  of  approach  here  is  optimistic:  It  is  the  opinion  of  the 
authors  that  Cohen's  remarks  are  valid,  despite  the  temptation  to 
present  analogues  (well-worn)  with  Heisenberg's  uncertainty  prin¬ 
ciple  in  quantum  mechanics. 

Until  relative^  recent  times,  little  systematic  effort  was  ex¬ 
pended  in  developing  a  rigorous  scientific  approach  to  the  analysis 
of  natural  language.  This  does  not  belittle  the  extremely  important 
contributions  of  Whorf  and  Sapir  (Whorf-Sapir  Hypothesis)  on  lan¬ 
guage  influencing  logic  (see  [163]),  Bloomfield's  ground-breaking 
work  [20]  nor  Carnap's  attempts  at  formalizing  language  influencing 
logic  [26],  [27].  Nevertheless,  it  was  not  until  the  1950's  and 
1960 ' s ,  beginning  with  Chomsky  [34],  [35],  Harris  [107]  and  others, 
that  a  rigorous  approach  was  taken  to  the  analysis  of  syntactic 
transforms  and  relations  in  natural  language.  At  present,  many 
diverse  viewpoints  (see,  e.g.,  [47])  mark . the  field  of  formal  lin¬ 
guistics  (see  [173]  for  surveys  illustrating  several  conflicting 
viewpoints).  In  the  area  of  semantics,  similar  diverse  viewpoints 
have  sprung  up.  (See,  e.g.,  Lyons  [163]  and  Leech  [153]  as  well  as 
earlier  interesting  empirical  studies  of  Osgood,  et  al .  ,  [198]  and 
the  independent  recent  theories  of  Grenander  [98].)  In  addition, 
statistical  techniques  have  been  used  in  analyzing  various  linguis¬ 
tic  problems  such  as  message  rates,  speech  patterns  and  comparisons 
of  literary  styles  [108].  Nevertheless,  relatively  little  effort 
within  the  large  body  of  present  day  formal  linguistics  has  been 
devoted  to  analyzing  natural  language  from  a  multi-valued  logic 
viewpoint . 

It  is  our  opinion  that  more  emphasis  must  be  placed  upon  inte¬ 
grating  semantics  with  syntactics  within  a  general  logic  viewpoint. 

Beginning  in  the  1920 's  with  the  Polish  school  of  multi-valued 
logic  (see  Rescher  [214]  for  an  excellent  survey  of  the  field), 
other  related  logics  were  developed,  including  modal  logic  [245], 
[28],  intuitionistic  logic  [110]  and  temporal  logic  [215],  [173']. 
(See  Haack  [100],  [101]  for  a  survey  of  various  systems.)  These 
logical  systems  (especially  all  of  the  variants  of  Lukasciewicz 
Logic)  were  developed  because  of  the  apparent  inability  of  classical 
logic  to  model  degrees  of  truth  and  uncertainty.  It  was  not  until 
Zadeh's  pioneering  work  in  1965  [276‘],  and  apparently  independent¬ 
ly,  Klaua's  similar  (but  not  as  well  known)  development  in  1965 
[139],  that  comprehensive  models  of  sets  based  upon  multi-valued 
logic  were  established.  Earlier  work  in  this  area  include:  Black's 
"consistency  profiles"  [19];  Shirai's  [234],  and  independently, 
Skolem's  set  theory  [239]  both  based  upon  three-valued  logic; 
Borel's  ideas  [22]  -  also  discussed  in  Godal  and  Goodman  [75]; 
Sheppard's  quantification  of  linguistic  concepts  [233];  Watanabe's 
continuous  set  membership  function  applied  to  quantum  mechanics 
[268];  Gilmore's  partial  sets  [74];  Chang's  infinite-valued  set 
theory  [31];  and  Zinov'ev's  [292]  and  Korner's  contributions  [145]. 
(See  also  Maydoie's  treatise  on  many-valued  logic  as  a  basis  for  set 
theory  [ 172  ]  .  ) 

Since  Zadeh's  first  work,  he  and  others  have  developed  exten¬ 
sive  theories  and  applications  of  fuzzy  sets  based  upon  infinite¬ 
valued  lcgic  -  especially,  with  truth  values  in  the  unit  interval, 
or  more  generally  within  some  lattice-like  structure.  Particular 
emphasis  has  been  on  the  development  of  fuzzy  logic  and  the  inter¬ 
pretation  of  natural  language  [280],  [283],  [286]  -  [290].  In 
addition,  other  schools  of  thought  have  arisen  concerning  modeling 


10 


Goodman  and  Nguyen 


of  sets  and  uncertainties,  including  Dempster-Shaf er 1 s  belief  theory 
[46],  [230],  Cohen's  inductive  probability  theory  [37],  Shackle's 

degree  of  surprise  concept  [222]  as  well  as  many  variants  on  classi¬ 
cal  and  subjective  probability  theory.  (See,  e.g.,  Lindley,  Tversky 
and  Brown  [157]  for  second  order  probabilities  and  Fine  [58]  for  an 
extensive  survey  of  approaches  to  probability  models.)  Freeling 
[64]  has  recently  made  some  comparisons  of  these  schools.  The 
Dempster-Shaf er  theory  is  motivated  to  a  large  degree  by  Dempster's 
original  theorem  [46],  [230]  establishing  "upper"  and  "lower"  proba¬ 
bility  uncertainty  measures  bounding  the  actual  induced  probabil  ity 
measure  by  a  functional  transform  of  a  random  variable  -  when  the 
transform  is  known  only  up  to  coverage  set  values  (resulting  in  a 
random  set).  Somewhat  analogous  is  the  work  of  Bertsekas  and  Rhodes 
[16]  and  Schweppe  [225]  in  using  set  estimates  in  filtering  pro¬ 
blems.  The  uncertainty  measures  (yielding  all  four  Choquet  capacity 
forms  [90])  consist  of  equality-inequality  extensions  of  the 
Poincar6  expansion  of  the  probability  of  the  union  of  a  collection 
sets  in  terms  of  the  alternating  sum  of  the  probabilities  of  the 
intersections  of  all  subclasses  of  the  collection,  and  the  dual 
expansion.  The  Choquet  capacity  theorem  [36]  -  as  demonstrated  by 
Nguyen  [187]  (see  also  Goodman  [90])  shows  that  upper  and  lower 
probability  -  as  well  as  other  associated  measures  from  the 
Dempster-Shaf er  theory  -  may  all  be  represented  through  subset  and 
superset  coverage  and  incidence  functions  for  uniquely  determined 
random  sets  associated  with  these  measures.  Indeed,  there  is  a 
relatively  simple  simultaneous  bijective  correspondence  between  the 
class  of  all  possible  random  subsets  (distinct,  relative  to  distri¬ 
butional  considerations)  of  a  given  space  and  each  of  the  four 
classes  of  uncertainty  measures  [90],  [116],  (See  also  Chapter  3.) 
A  list  of  various  uncertainty  measures  is  presented  in  Dubois  and 
Prade  [51].  (See  again  Chapter  3.)  See  also  the  recent  interesting 
arguments  between  Bayesian  and  Dempster-Shaf er  proponents  over 
Shafer's  approach  [231]  to  the  Lindley  "paradox"  and  in  a  connected 
vein,  see  Lindley' s  claim  [158]  that  subjective  probability  theory 
is  the  only  appropriate  approach  to  uncertainty  modeling.  (However, 
see  our  discussions  on  Lindley 's  claim  in  Chapter  10.) 

Random  set  theory  and  its  modifications,  (such  as  through  equi¬ 
valence  classes  of  random  sets),  could  well  provide  a  key  to  a 
meaningful  analysis  of  the  many  approaches  to  the  modeling  of  un¬ 
certainties  by  the  establishment  of  rigorous  relationships  between 
the  theory  of  perception,  natural  language  descriptions,  multi¬ 
valued  truth  and  set  theory,  and  other  disciplines.  The  intuitive 
basis  for  the  use  of  random  sets  appears  only  slightly  more  compli  - 
cated  than  that  for  ordinary  random  variables.  Indeed,  as  some 
results  in  this  text  show,  there  is  a  natural  relationship  between 
random  sets  and  random  variable  representations  of  dispersions 
( Chapter  5.2). 

If  dispersion  theory  and  associated  calculus  are  to  become  a 
useful  approach  to  knowledge-based  systems,  not  only  must  feasi¬ 
bility  of  implementation  be  shown  (as  the  many  applications  in 
literature  indicate  -  see,  e.g.,  Dubois  and  Prade  for  a  listing 
[53]),  but  also  a  sound  theoretical  foundation  must  be  established. 
Moreover,  it  is  imperative  to  determine  what  direct  relationships, 
if  any,  are  there  between  dispersion  theory  and  existing  well- 
established  theories  such  as  probability  theory  and  set  theory. 

Four  steps  toward  the  above-mentioned  goals  will  be  discussed 
in  the  text.  In  summary: 

1.  General  logical  systems  can  be  constructed  which  contain 
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classical  probability  logic.  (See  section  2.3.5.)  On  the  other 
hand,  any  attribute  may  be  represented  by  the  class  of  all  random 
subsets  of  the  attribute's  domain  space  which  have  a  common  one 
point  coverage  function  coinciding  with  the  dispersion  for  the 
attribute.  A  related  result  holds  with  respect  to  random  variables. 
In  addition,  isomorphic-like  relations  (i.e.,  isomorphic  relations 
modified  for  equivalence  in  the  one  point  coverage  sense)  exist 
between  large  classes  of  dispersion  operations  and  ordinary  corres¬ 
ponding  operations  on  random  sets.  In  general,  each  ordinary  set 
operation  on  random  sets  has  many  isomorphic-like  dispersions  ex¬ 
tensions.  (See  Chapter  6.) 

2.  Dispersions  may  be  derived  as  truth  function  realizations 
within  the  unit  interval  of  truth  values  of  compound  predicates 
involving  basically  memberships  of  points  in  sets.  These  compound 
predicates  -  before  evaluation  -  are  in  the  main,  motivated  by,  and 
have  the  same  forms  as,  those  in  classical  two-valued  logic. 
Alternatively,  a  topos  (or  topos-like)  structure  (Johnstone  [127], 
Fourman  and  Scott  [59])  encompassing  dispersion  theory  can  be 
established  so  that  all  classical  concepts  can  be  put  into  a  formal 
language  which  is  realizable  through  that  structure  and  may  be 
interpreted  as  generalizations  of  the  classical  concepts  (Eytan 
[56],  [57],  Ponasse  [204]).  (See  section  2.4.) 

3.  Natural  language  expressions  may  be  symbolized,  and  sub¬ 
sequently  analyzed,  in  an  efficient  manner  by  use  of  dispersion 
operations,  often  without  the  constraints  posed  by  a  probability 
formulation.  Rather,  more  compatible  with  the  often  vague  and 
non-numerical  nature  of  language,  an  equivalence  class  of  probabil¬ 
istic  models  (more  specifically,  random  sets)  should  be  used,  which 
in  effect  is  the  same  as  simply  employing  a  dispersion  description. 
A  number  of  examples  will  be  presented  in  section  2.6  showing  how  a 
variety  of  different  sentence  types  may  consequently  be  interpreted 
within  a  dispersion  formulation.  (See  also  the  appendix  in  Chapter 
10.) 


4.  By  making  use  of  the  results  presented  in  this  monograph,  a 
detailed  application  is  given  in  Chapter  9,  demonstrating  how  dis¬ 
persion  theory  may  be  used  as  a  vehicle  in  the  symbolization  and 
utilization  of  natural  language  information  in  conjunction  with  any 
probabilistic  information  present. 


1 . 3  Organization  of  the  Monograph. 

The  book  is  composed  of  ten  chapters.  The  bibliography  of 
those  works  cited  here  is  given  ci  t  the  end . 

With  respect  to  the  mathematics  involved,  a  standard  knowledge 
of  mathematical  logic,  measure  theory,  probability  theory  and  mathe¬ 
matical  statistics  is  required.  In  Chapter  2,  some  topos  theory  and 
algebraic  logic  background  is  necessary.  Since  the  book  is  moti¬ 
vated  and  developed  towards  applications,  familarity  with  systems 
concepts  may  also  be  useful . 

Graduate  students  in  applied  mathematics,  engineering  science, 
and  mathematical  statistics  should  find  no  great  difficulty  in 
reading  the  treatise. 

This  chapter  has  presented  some  motivations  for  studying  un¬ 
certainty  models,  as  well  as  a  framework  of  analysis,  and  a  brief 
survey  of  the  field.  Chapter  2  deals  with  the  concepts  of  natural 
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and  formal  language,  semantic  evaluations,  multiple-valued  logic  and 
set  theory  and  general  logical  systems.  General  logical  systems 
encompass  both  (i)  truth-functional  systems  that  extend  Zadeh's 
and  other  concepts  of  fuzzy  set  theory  and  ( i i )  classical  pro¬ 
bability  logic  (a  semi-truth  functional  system)  and  a  natural 
extension  (Lebesgue-Stielt jes  Logic).  Also  in  Chapter  2,  examples 
of  truth  evaluation  of  natural  language  are  shown.  (Because  of  the 
number  or  topics  treated,  which  are  important  for  the  development  of 
the  remaining  portion  of  the  text,  Chapter  2  is  of  much  greater 
length  than  the  other  chapters.)  Chapter  3  treats  useful  classes  of 
uncertainty  and  the  concept  of  dispersions.  Chapter  4  gives  an 
introduction  to  the  well-established  (but  relatively  little  known) 
theory  of  random  sets  needed  for  subsequent  investigations.  In 
Chapter  5,  a  systematic  study  of  the  connection  between  random  sets 
and  dispersions  is  carried  out.  Following  this,  Chapter  6  develops 
the  isomorphic-like  connections  between  general  logical  systems  and 
random  set  models.  Chapter  7  discusses  some  relevant  theories  of 
uncertainty,  including  the  ideas  of  Manes,  Watanabe  and  Gaines. 
Chapter  8  develops  rules  of  inference  based  upon  techniques  of 
general  logical  systems  investigated  in  preceeding  chapters.  These 
rules  can  be  applied  to  the  problems  of  knowledge  representation, 
combination  of  evidence  and  decision  making  in  knowledge-based 
systems.  Chapter  9  is  a  detailed  application  of  the  techniques 
developed  in  this  monograph  to  a  specific  problem,  namely  the  target 
data  association  problem. 

A  summary  of  the  chief  results  of  the  text  is  given  in  Chapter 
10.  This  chapter  also  contains  the  discussion  of  research  issues 
and  future  directions. 


CHAPTER  2 


SYMBOLIZATION  AND  EVALUATION  OF  LANGUAGE 


2 . 1  Introdactlon 

One  of  the  most  distinguishing  features  of  mankind  with  respect 
to  other  life  forms  is  the  ability  to  develop  and  formalize  abstract 
concepts  and  ideas  (the  recent  chimpanzee-ape  language  controversy 
notwithstanding  -  see  [681]).  Natural  language  is  the  initial  ve¬ 
hicle  through  which  these  ideas  and  concepts  are  expressed.  The 
most  complex  mathematical  expressions  are  meaningless  unless  they 
represent  concepts  at  least  theoretically  expressable  in  natural 
language.  Thus  in  summary.  It  is  not  unreasonable  to  state  that  all 
ideas  may  be  posed  within  a  natural  language  format.  In  many  cases, 
vagueness  or  ambiguity  of  descriptions  are  not  easily  captured  by 
rigid  mathematical  expressions,  but  are  more  conveniently  repre¬ 
sented  in  natural  language. 

In  many  real-world  problems,  information  arrives  in  the  form  of 
natural  language  expressions  such  as  "if  the  skin  color  is  pink, 
jaundice  may  be  present"  or  "most  blue  birds  have  feathered  wings". 
Thus,  natural  language  as  interpreted  here  is  capable  of  both 
expressing  exact  concepts  as  well  as  ambiguous  or  vague  ideas. 

In  order  to  make  such  problems  more  precise  and  to  establish 
rigorous  solutions,  a  formal  language  for  meaning-representation 
must  be  established,  wherein  all  natural  language  concepts  are 
carried  over  into  semiotic  forms,  i.e.,  strings  of  symbols,  appro¬ 
priately  formed.  The  semantic  evaluation  of  these  strings  reflects 
the  degrees  of  uncertainty  associated  with  concepts  represented  by 
the  strings.  Ideas  which  are  "absolutely  true"  or  certain  are  eval¬ 
uated  as  T  or  unity.  On  the  other  hand,  concepts  such  as  "John  is 
quite  tall"  or  "an  orange  is  big"  may  have  less  than  unity  truth 
values  associated  with  them,  if  the  relevant  logic  is  not  just 
restricted  to  {0,1}  . 

This  chapter  establishes  a  unified  framework  connecting  natural 
language,  formal  language,  and  semantic  evaluations  of  expressions. 

The  following  diagram  in  Figure  I  illustrates  roughly  the  flow 
of  knowledge  from  conception  to  evaluation: 
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Figure  I .  Flow  of  knowledge  from  conception 
to  evaluation. 


2.2  Natural  and  formal  languages 
2.2.1  Preamble 

As  mentioned  before,  all  thoughts  of  man  are  enunciated  through 
natural  language.  Whether  the  particular  language  spoken  by  an  in¬ 
dividual  restricts  or  otherwise  channels  his  thoughts  -  as,  e.g. 
color  limitations  of  certain  African  tribes  or  expansion  of  subtlety 
as  in  German  philosophy  or  Eskimo  descriptions  of  snow  conditions  - 
is  still  a  matter  of  controversy.  (This  goes  back  to  the  famous 
Whorf-Saplr  hypothesis  concerning  natural  languages  influence  upon 
thinking  and  the  Berlin-Kay  "universalistic"  antithesis  [271], 
[153].)  These  concepts  include  vague  ideas  such  as  "love",  "hate", 
"much",  "some",  "above",  as  well  as  more  arithmetic  concepts  "if  two 
convex  bodies  in  Hilbert  space  are  disjoint,  then  a  separating 

fb 

hyperplane  may  be  found",  "  Ja  cos  (x)  dx  *  sin  (b)  -sin  (a)  ",  etc. 

See  Allerton  [3]  and  Dingwall  [47]  for  analytic  surveys  of  natural 
languages.  See  Lenneberg  [155]  for  a  discussion  of  the  biological 
basis  of  language.  Leech  [153]  proposes  that  language  can  be  rough¬ 
ly  divided  into  three  areas:  pragmatics,  syntactics  and  semantics. 

The  seminal  paper  of  Lakoff  [150* ]  on  linguistics  and  natural 
logic  is  completely  in  accord  with  the  spirit  of  the  approach  taken 
in  this  text:  to  model  how  humans  empirically  perceive  truth  through 
natural  language  descriptions  and  associated  reasoning. 

Formal  language  in  its  most  general  form  can  be  described  as  a 
collection  of  (1)  primitive  symbols,  (2)  syntactic  rules  for  combin- 
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ing  symbols,  and  (3)  compounds  of  strings  of  symbols  produced  by  the 
recursive  application  of  (2)  to  (1)  and  itself.  An  interpretation 
of  formal  language  always  exists,  symbol  by  symbol,  within  some 
"natural"  language.  Conversely,  it  is  assumed  that  any  natural 
language,  given  enough  time  and  patience,  can  be  fully  symbolized 
within  a  formal  language  setting.  Indeed,  the  thrust  of  modern 
formal  linguistics  as  carried  out  by  Chomsky  [35],  Lyons  [162], 
Allerton  [3]  and  others  is  the  development  of  formal  language  rules 
of  syntax  which  reflect  ana  explain  the  nature  of  the  natural 
language  in  question,  as  opposed  to  the  earlier  structuralistic- 
behavioristic-ciassif icatory  approach  to  language.  (See  "On  Noam 
Chomsky:  "Critical  Essays",  pp.  2-32,  by  Searle  [227]  for  further 
discussions  of  the  two  schools  of  thought  as  well  as  the  entire  book 
for  summaries  and  implications  of  Chomsky's  work.)  Basically, 
Chomsky  emphasises  syntactic  modeling  via  (1)  phrase  structure  rules 
which  carry  the  "deep  structure"  and  determine  therefore  the  seman¬ 
tic  content  and  (2)  transformational  rules  which  act  upon  (1), 
changing  forms  but  not  semantic  content,  such  as  inversion  of  words, 
use  of  synonyms,  and  change  of  voice  from  active  to  passive.  Even 
the  above  description  is  subject  to  many  exceptions  and  changes. 
(See  Dingwall  [47],  Allerton  [3]  for  surveys  and  controversies.) 

It  is  necessary  to  keep  in  mind  that  we  are  interested  in  a 
mathematically  sound  form  of  semantics  rather  than  the  "formal 
linguistic"  approach  as  mentioned  above.  Thus,  syntactic  structures 
that  will  be  developed  will  correspond  approximately  to  formal 
linguistic  phrase  structure  rules  rather  than  to  transformational 
rules . 


Parsing  principle 

In  applications  we  assume  a  mapping  exists  between  any  given 
natural  language  and  its  formal  language  representation. 

For  further  details  on  the  Parsing  Principle  and  other  related 
procedures  Interpreting  natural  language  in  terms  of  formal  lan¬ 
guage,  see  section  2.6  and  the  appendix  at  the  end  of  Chapter  10. 
For  an  excellent  survey  of  cognitive  processes  and  natural  language 
see,  e.g.,  WInograd  [274'].  See  also  Gregg  [98']  for  a  compendium 
of  papers  on  knowledge  and  cognition.  For  an  extensive  presentation 
of  semiotics  (the  study  of  symbols  used  in  language  and  codes),  see 
Eco  [53']. 


2.2.2  Basic  structure  of  a  formal  language 

We  present  here  the  basic  structure  for  a  formal  language. 
The  metalanguage  and  metalogic  of  the  following  study  is  ordinary 
English  within  the  two-valued  classical  predicate  calculus  setting. 
(See  the  remarks  in  the  Introduction  (Chapter  1).) 

A  many-sorted  (possibly  higher  order)  formal  language  t  con¬ 
sists  of  a  syntax  part  Lsyn  '  P^3  a  possible  additional  theory 

Th„(£)  and  the  results  of  that  theory.  In  addition,  L  operates 

upon  its  alphabet  or  collection  of  symbols,  syrob(i),  including 

parentheses  (  )  ,  letters  a ,  p ,  y,  ...,  a,  b,  c,  ...,  i,  j,  k,  .... 

braces  (  }  ,  formal  deduction  (-  ,  and  specially  designated  symbols 

such  as  0  ,  1  ,  n  ,  V  ,  etc.  From  now  on,  we  will  use  Symb(i)  , 
a 

but  not  consider  it  as  a  formal  entity.  (Note,  however,  the  car- 
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dinality  of  Symb{4)  Is  needed,  e.g.,  for  the  Lowenheim-Skolem 
property.  See  2.3.3  (a)  (xii),) 

*syn  contains,  firstly,  a  category  of  types  of  things, 

Cor e(t)  .  (See  the  appendices  at  the  end  of  2.4  for  definitions  and 
properties  of  categories  and  deduction  categories.)  The  objects  of 
category  Core(£)  are  the  actual  types,  including  the  subclass  of 
sorts,  consist  ing  of  0b(£)  ,  an  index,  referring  formally  to  rele¬ 
vant  universes  of  discourse,  Ar(z)  ,  an  index,  referring  to  arrows 
(generalizations  of  functions)  between  these  universes  of  discourse, 
and  an  index  Rel(z)  ,  referring  to  relations  upon  the  universes  of 
discourse.  The  arrows  of  category  Cor e ( X )  are  the  function 
symbols  occurring  between  the  types,  each  such  symbol  formally 
representing  a  function  from  one  type  to  another.  (Two  given  types 
may  have  an  infinity  of  arrows  between  them.)  The  collection  of  all 
such  arrows  or  function  symbols  for  Core(z)  contains  several  im¬ 
portant  subclasses,  including:  Cat(£)  ,  the  class  of  all  function 
symbols  representing  category  (and  deduction  category)  theory  con¬ 
cepts,  such  as  arrow  domain,  codomain,  composition,  projection, 
identity,  product,  substitution,  signature,  used  for  sets,  functions 
and  relations  (as  later  interpreted  in  the  semantic  evaluations  in 
section  2.3);  Foun(z)  ,  the  class  of  all  function  symbols  repre¬ 
senting  set  theory  concepts,  including  membership,  equality,  class 
abstraction;  Loc(z)  ,  the  class  of  all  functional  symbols  represent¬ 
ing  logical  connectors,  including  T  ,  l  ,  not  ,  &  ,  or  ,  implica¬ 
tion,  etc.;  and  Quan(i)  ,  the  class  of  all  function(al)  symbols 
representing  quantifications  such  as  universal  quantification  and 
existential  quantification,  as  well  as  including  possibly  various 
degrees  of  partial  quantification  ("much",  "many",  "about  3/4", 
etc .  )  . 

Secondly,  *syn  contains  a  theory  of  syntax  Thsyn***  con¬ 
sisting  of  axioms  and  rules  described  through  sequents  (or  certain 
ordered  pairs  involving  predicate  symbols)  which  produce  the  class 
of  deducts  or  theorems  De(Th(Zsyn))  '  ^rom  which,  in  turn,  one 

obtains  ?ar (l)  . 

Var (*)  is  a  deduction  category  representing  all  basic  varia¬ 
bles  for  t  .  The  object  part  of  \?ar [l)  ,  0b(9ar(z))  consists  of 
the  collection  of  all  formal  universes  of  discourse  of  interest, 


i,  j,  k,  ...;  the  arrow  part  of  9ar(i)  ,  Ar(9ar(£))  ,  contains  all 

arrows  (or  generalized  functions)  between  the  formal  universes  of 
discourse,  (f  :  1-*  J),  (g  :  k  -*  j),  (h  :  j  x  j  -*  k  )  ,  :  k  -►  i), 

including  the  subclass  Wfv(X)  of  all  projections  here  or  individ¬ 
ual  variables  such  as  x*1*  =  (proj^(kxi)  :  k  x  i  -*  i)  , 

( k ) 

y  =  (proj^(kxix  j )  :  kxixj  -•  k)  ,  etc.,  and  the  bigger  subclass  of 
all  individual  terms  Wft(i)  which  is  defined  as  the  composition  of 


all  arrows  of  ^ar(i)  with  all  elements 
where  well-defined  (by  use  of  Thsyn(£))  , 


g(y (k* )  =  g°y(k) 


h(f (x(1) ) 


g(y(lc) ) ) 


(i.e.,  arrows)  of  Wfv(£)  , 
such  as  f (x(1) )  -  fox*1*  , 

ho ( <f°x* *  * , goy * k * > )  ,  etc. 


The  relation  part  of  7ar {l)  ,  Rel(Var(£))  consists  of  all 
relation  symbols  (formally  representing  relations)  upon  the  formal 
universes  of  interest,  Tj,  s^,  ...,  rjx^<  r^,  Sj,  tfc,  ...  .  This 
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includes  the  subclass  Wfat(t)  ,  the  atomic  well-formed  formulas, 
which  is  defined  as  the  class  resulting  from  the  substitution 
operation  (where  well-defined)  of  individual  terms  into  relation 

symbols  such  as  s^[x^*^],  r^^[<f  .  etc.  In 

turn,  Wfat(r)  is  included  in  a  generally  bigger  class,  Wff(t)  , 

of  Rel(^ar(£))  ,  the  class  of  all  well-defined  formulas  of  z  , 

generated  or  spanned  recursively,  by  applying  Loc(t)  and  Quan(i) 
(where  well-defined)  to  Wfat(i)  . 

We  define  the  class  of  all  basic  well-formed  terms  of  z  as 

Wfter(t)  =  Ar($ar(t))  U  Rel($ar (Z))  ,  which  contains  the  class  of 

all  well-formed  expressions  of  z  ,  Wfex(t)  3  wff(t)  u  Wft(t)  . 

Connected  with  Wfex(t)  are  the  total,  free,  and  bound  indi¬ 
vidual  variable  maps  which  identify  which  individual  variables  occur 
in  a  given  well-formed  expression,  which  are  bound  or  controlled  by 
quantifiers,  and  which  are  free  of  control. 

i'hv.(£)  ,  analagous  to  Th  [t)  ,  consists  of  axioms  and  rules 

iv  syn 

involving  sequents,  but  in  this  case,  the  sequents  are  all  of  the 
form  (y,  9 )  ,  where  y,fl  e  Wff  {£)  and  b  is  the  fixed  predi¬ 

cate  symbol  representing  formal  or  deductive  implication  and  hence 
(y  b  e)  may  be  interpreted  as  "  f  deductively  implies  a"  ,  etc. 

The  chief  consequence  of  ThK(i)  is  De(ThR(x.)),  the  class  of  all 

sequents  here  obtained  by  (recursively)  applying  the  rules  to  the 
axioms.  Each  such  (y,  bL  a)  e  De(ThK(£))  may  be  considered  a 

theorem  of  ThK(£)  with  hypothesis  y  and  conclusion  a  .  Depend¬ 
ing  on  the  choice  of  ThR(£)  ,  De(ThK(i))  will  reflect  the  various 

relationships  between  wff's  relative  to  FuncSymb(Z)  .  The  choice 
of  ThK(4)  -  as  presented  later  in  this  part  -  usually  emphasizes 

Foun(-t)  ,  Loc(z)  ,  and  Quan(Z)  -  as  opposed  to  Th_„_  (z)  which 

syno 

essentially  refers  only  to  Cat(i)  .  Although,  a  specific  theory 
will  be  chosen  for  Th^(4)  here,  it  is  obvious  that  any  other 

theory,  say,  ThK,(z)  could  be  appended  in  place  of  ThR{£)  and 

developed  analogously.  Indeed,  it  is  in  this  sense  that  we  define  a 
formal  language  as  used  throughout  the  remainder  of  the  text.  When¬ 
ever  Thj,(X)  is  actually  used  in  t  ,  this  will  be  so  indicated 

(and  called  Th^(z. )  )  . 


I .  Basic  syntax  of  z 

The  basic  syntax  LSyn  of  4  may  be  described  as  consisting 

of  the  core  of  z  ,  Core(t)  ,  the  syntax  theory  Th  (z)  ,  the 

syno 

variable  class  ?ar(i)  ,  the  well-formed  term  class  Wftero(r)  ,  the 

well-formed  expression  class  Wf exo (i)  ,  and  the  free  and  bound 

individual  variable  maps  FVq(z)  ,  BnVo(£)  ,  respectively. 

Specifically,  Core(i)  =  ( Ob j ( z ) , Arr ( z ) )  is  a  category  of 

types,  so  that  we  may  identify  Typ(£)  =  Obj(x)  ,  the  object  class 
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of  Core(A)  and  FuncSymb(A)  =  Arr (A)  ,  the  arrow  or  functional 
symbol  class  of  Core(A)  .  (For  a  review  of  category  theory  con¬ 
cepts,  see  appendices  at  the  end  of  2.4.)  A  is  a  higher  order 
formal  language,  in  general  here,  because  Sort (A)  ,  the  class  of 
basic  types  or  kinds  of  things  we  are  concerned  with,  is  more  than 
one  in  number.  Here  Sort (A)  =  ( 0b( A ) , Ar ( A ) , Rel ( A ) )  ,  where:  Ob(A) 
refers  to  objects  which  later  (in  the  semantic  evaluation)  may  be 
interpreted  as  generalized  sets  or  members  of  such  (generalized) 
sets;  Ar(A)  refers  to  things  which  later  may  be  interpreted  as 
functions  or  arrows  between  the  objects  or  generalized  sets;  Rel(A) 
refers  to  things  which  may  be  interpreted  as  relations  in,  or  on, 
the  generalized  sets.  In  turn,  Typ(A)  ,  the  class  of  all  types, 
can  be  considered  as  the  class  of  all  possible  compound  or  mixtures 
of  sorts,  relative  to  the  primitive  operation  ®  ,  cartesian 
product,  and  possibly  additionally  @  ,  exponentiation.  Thus,  we 
can  write 


Typ(A)  =  span( Sort ( A ) , {« ,  @  })  . 

By  the  symbol  span(A,Q)  ,  we  mean:  A  is  a  given  collection 
of  entities  and  Q  =  is  a  collection  of  operators 

or  more  generally,  procedures,  with  appropriate  mixed  iterates 

CL  •••Q  well-defined  ,  («,,.,,«)  £  (1,2,.,.)  where  in  particu- 
1  n  in 

lar,  we  note  Q  n  =  Q  •••Q  and  Q°  »  Identity,  such  that 
u  u  u  w 

(1)  Qu  • • • Qu  oSpan(A,Q)  £  span(A,Q)  ;  for  all 
1  n 


(2)  span(A,Q)  is  the  smallest  such  collection  in  (l)  so  con¬ 
structed  . 

Another  name  for  this  procedure  is  the  free  algebra  generated 
by  Q  over  A  or  the  strings  generated  by  Q  over  A  .  (See 
Enderton  [54],  Chapter  1  for  further  elaborations  and  connections 
with  homomorphic  extensions,  etc.) 

Back  to  types:  A  typical  example  of  a  type  here  is 

Ob ( A )  x  Ob ( A )  x  nRel(A)xAr(A)  ,  etc 

(Note  also,  the  symbols  x  and  n  as  used  in  a  later  part  of  the 
development  of  a  will  have  a  different  interpretation.) 

The  arrows  or  functional  symbols  making  up  Arr (A)  are  of  the 
typical  form  (remember  that  Core(A)  is  assumed  to  be  a  legitimate 
category!)  ( F  :  a  -*  p)  or  equivalently  F  e  Arra  ^(Core(A))  ,  for 

any  a,p  e  Typ ( A )  .  Thus,  composition  and  identity  are  assumed  well- 
defined  here  so  that,  if  a,p,t  e  Typ ( A )  and  F  €  Arr  (Core(A))  , 

(X  ,  p 

G  €  Arr  ( Core ( A ) )  ,  then  G«F  «  Arr  (Core(A))  and 

H  *  T  Q  t  • 

Id^oF  =  F  =  F°Ida  ,  etc.  (Again,  note  that  abuse  of  notation  takes 

place  here,  since  °  ,  Id  are  used  later  for  a  different  meaning.) 

Arr (A)  contains  a  number  of  specially  designated  function 
symbols ,  i . e . , 
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where : 


Cat  (i. )  2 


Foun(i) 2 


Loc(*)  2 


Quan (£ )  2 


Arr(l)  2  Cat (£ )  U  Foun(Z)  U  Loc(t)  U  Quan(i), 


S  =  (S  : 

Ar(£)  ObU)) 

domain  map 

B  =  (B  : 

Ar(£)  -♦  Ob(i ) ) 

codomain  map 

o  =  (° 

Ar(£)XAr(£)  -»  Ar(i))  - 

composition  map 

Id  =  (Id  : 

Ob(i)  -♦  Ar(i)) 

identity  map 

proj  *  (proj  : 

0b(r)X0b(£)  Ar (/.))  - 

projection  map 

a  =  (o  : 

•[••]  =  (•[••)  : 

Rel  (L )  Ob(Z)) 

signature  map 

Rel(£)*Ar(£)  -  Rel(£)) - 

substitution  map 

x  =  (x  ; 

Ob(Z)n  -*  0b(£ ) ) 

(cartesian  object) 
product  map 

t  =  (t  : 

Ob(i)n  -  0b(£) ) 

(cartesian  object) 
sum  or  coproduct  map 

<•>  =  (<_>  : 

0b(£)  -•  Ar(A)xAr(£) ) 

special  arrow 
product  map 

'  n  =  (n 

0b(*)  -  Ob (l)) 

-  exponent  map 

€  =  (€ 

*  =  (* 

Ob (t)  -♦  Rel(£) ) 

-  membership  map 

Ob(i)  Rel(i)) 

-  equality  map 

Ex  ■=  ( Ex 

Ob(i)  -  Rel (/.) ) 

-  existence  map 

{•!••)  =  ({•!•} 

Ob(A)xOb(t)xRel(A)  Ar (£))  -  class 

■ 

abstraction  map 

T  =  (T  : 

0b(£ )  -  Rel(Jt) ) 

-  truth  map 

1  =  (1  : 

ob(i)  Rel(i) ) 

-  false  map 

nt  ■  (nt  : 

Rel(i)  -  Rel(£)) 

-  negative  map 

&  =  (&  : 

Rel(*r  Rel(t) ) 

-  conjunction  map 

or  *  (or  : 

Rel(Jt)n  -»  Rel{£) ) 

-  disjunction  map 

=»  =  (»  : 

Rel(i)  )xRel(i)  -♦  Rel  (£■ ) ) 

-  implication  map 

id  =  (Id  : 

Rel(4)  -  Rel(A) ) 

-  identity  relation 
map 

maybe  =  (maybe  : 

Rel (i)  -  Rel (t) ) 

-  doubt  relation 
map 

poss  =  (poss  : 

Rel (£)  -«  Rel(Z)) 

-  possibility  map 

V  =  (  V 

Ob(i  )xOb(Z  )xRel  (l )  -»  Rel  (i))  -  universal 

quantification 

3  =  l  3  : 

Ob(t)xob(X.)xRel  (t)  -•  Rel  (I.))  -  existence 

quantification 

Mo  =  (Mo  : 

Ob(t  )x0b(£)xRel  (t )  Rel(I))-  "most"  map 

Abt3/4  =(Abt3/4  : 

Ob(i  JxOb(£)xRel  (f. )  -»  Rel(f.))-  "about  3/4”  map 
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Note  that  we  can  classify  the  logical  connective  functional 
symbols  as 

00 

Loc ( t  )  =  U  Loc  (£)  , 

n=l 

where  Locn(x)  represents  the  n-ary  logical  connectors  of  interest. 
Hence , 

Loc^(t)  2  (nt,  id,  ...} 

LoCj  (•£)  2  {  =»  ,  «=*,  &,  or,  ...)  , 

etc  . 

Note  that  in  the  case  of  &  and  or  ,  we  may  well  wish  to  ex¬ 
tend  their  domains  to  include  any  number  of  arguments.  (One  way  of 
doing  this  is  to  formally  assume  sy.umetry  and  associativity,  intro¬ 
duced  by  adding  the  appropriate  axioms  and  rules.  (See  the  later 
development  for  t  in  III  and  VI  for  basic  theorems  connected  with 
theory  Th^(t)  ,  demonstrating  symmetry  and  associativity  of  opera¬ 
tors  &(=a  here)  and  or(=v  here).} 

In  order  to  complete  the  syntax  of  L  we  need  the  concept  of  a 
theory  for  t  .  In  addition,  we  will  also  need  this  concept  when  we 
consider  logical,  set  foundations  and  quantification  properties  for 
t  . 

First,  define  the  (Gentzen)  sequent  class. 

Let  Predsym(b)  be  a  collection  of  predicate  symbols.  Then 
Seq^ ( l )  =  A(t)  x  Predsym(b)  x  a(l) 

where  A(t)  is  some  class  of  elements,  sc  that  typically,  if 
v,8  «  Term  ( L )  ,  [r  L  0  )  «  Se^Term(t)^i)  ’  0i  if'  further< 
r,0  e  Var (Ar (b  )  )  ,  (*  =ArU)»)  «  Se<?Var  ( Ar  (i  )  j  ( *’ >  '  etc‘ 

Then  any  axiom  collection  Ax(t)  for  L  aid  x(i)  satisfies 

0  »*  Ax(£)  £  Seq  (i) 

AU) 

and  any  rule  collection  Rul(i)  for  L  and  *{t)  satisfies 

CO 

0  *  Rul  ( L )  £  Seq(oo)  (L)  =  u  Seq (X ) x . . . xSeq ( i )  . 

x{t)  n=l  > - „ - - 

n  factors 

A  corresponding  theory  Th(X)  is  4  hen  defined  to  consist  of 
Th(t)  3  (a x(x) ,  Ru 1(b))  , 

with  the  class  of  deducts  or  theorems  De(Th(X))  produced  by  Rul(x) 
acting  upon  (or  more  formally,  as  a  relation  or  fcnction  product 
upon)  Ax ( t )  .  Thus,  we  write 

De(Th(X))  =  span(Th(x))  £  Seq  ( L )  . 

A{t) 
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In  general,  each  element  R  e  Ru 1(f)  is  in  the  form 


If  1*"  1®  1  >  '  •••'  <Vn9J  th*n 


n  n  n' 


yd) 
n+l  n.+  l  n+1 ' 


where  "if"  and  "then"  are  in  the  usual  0-1  metalogic  context 

U)  ■ 

A(L) 

Furthermore,  we  may  be  interested  only  in  some  subclass  of 
deducts 


and  ‘¥VUV  6  Seq 


Ser  (i.)  s  De(Th(*)  )  . 

A(L) 

For  any  (?i0 )  e  Seq  (t)  ,  we  call  r  the  hypothesis  and 

A.(L) 

0  the  conclusion  or  inference. 

Next,  the  basic  variables  and  constants  of  L  will  be  con¬ 
sidered.  This  is  provided  by  a  fixed  basic  variable  mapping 
compatible  with  x  and  fi 

Var  :  TypU)  -  Ob(SET)  , 

where  SET  =  (Ob(SET),  Ar ( SET ) )  is  the  category  of  all  ordinary  sets 
and  functions  between  them.  (Again,  see  Appendix  1  at  the  end  of 
2.4)  ,  so  that  in  particular  the  basic  variables  are 

Var(ObU))  2  (i,  ij,  i2 . j,  J1#  j2 . k,  ...} 

individual  object  symbols 

Var(Ar(i ) )  2  (f,  fj,  f2<  ...,  g,  ...,  h,  ...} 

individual  arrow  or  function  symbols 


Var ( Rel ( £ ) )  2  (r,  Tj ,  r2, 


s ,  s 


1' 


t ,  u. 


,  W, 


. ) 


individual  relation  symbols 
Var  ( Ob  (x. )  xob  (/. )  )  =  Var(0b(x.)  )xVar(0b(i  )  )  , 

Var(Ar(f )xnRel(Z) )  =  Var  Ar ( i ) ) xfiVar ( ReI ] {L ] )  , 


etc.,  where  the  right-hand  side  symbols  x  ,  fl  are  ordinary 

cartesian  product  and  exponentiation. 

Note  also  that  generally,  basic  variables  include  basic 
constants,  denoted  by  Cons ( • )  ,  so  that 

Var ( Ob ( £ ) )  2  Cons ( Ob ( 4 ) ) 


where  1^,  1^,  ...  represent  individual  constants  or  terminal  ob¬ 

jects  in  category  theory  notation  and  0  represents  the  null  object 
or  null  set  or  the  initial  object  (in  category  theory  notation). 

Other  constants  (or  semi-constants,  such  as  terminal  arrows 

f  .  :  i  -»  1  )  or  initial  arrows  (g  .  :  0  ->  j )  can  be  desig- 

l  ,  a  a  0 ,  j 

nated  as  needed  in  Cons(Ar(£))  £  Var(Ar(£))  ,  etc.  (Again,  abuse 

of  rotation  here  involves  1  and  0  , )  le  define  the  class  of  all 

basic  variables  as 
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Var(Typ(£))  =  U  Var(a) 

aeTyp(£) 

and  the  ordered  pair 

Var  (£ )  =  (Var(Ob {t)),  Var(Ar(£)))  . 

Next,  we  consider  the  class  of  basic  terms,  Term(£)  ,  possibly 
quantified  formulas  Form{£)  ,  atomic  formulas.  Atom (£)  ,  individ¬ 
ual  terms,  Indterm(i)  ,  and  individual  variables,  Indvar(l)  ,  and 
individual  constants,  Indconst(A)  . 

We  first  form  the  formal  spans  generating  successively,  these 
classes.  But  in  order  to  restrict  the  classes  to  meaningful  quan¬ 
tities,  we  must  add  the  first-level  syntax  theory 

ThI,syn(i)  =  (AxI,syn(£)'  RuII,syn(i))  9ivin9  Var(£)  a  category 
structure.  (See  later  remarks.) 

In  turn,  we  also  add  the  second-level  syntax  theory 
ThII,syn(£)  =  (AxII,syn(i) '  RulII , syn{t) >  '  to  completely  specify 

the  well-formed  terms  Wftero(£)  and  well-formed  expressions 
Wfexo(£)  . 

Thus,  we  have  formally, 

Term(t)  =  span( Var ( Typ ( L ) ) ,  Arr(l)) 

2  Form(i)  =  span  ( Form  *  (£)  ,  Quan(£)) 

2  Form 1  ( £ )  -  span( Atom(£ ) ,  Loc(Jt)) 

2  Atom(£)  ^  Var(Rel (£) ) [Indterm(£ ) ] 

^  (■ [• • ])(Var(Rel(£) ) ,  Indtermf  £))  . 

Also , 


Term(£)  2  Indterra(£)  =  span( Indvar (£ ) ,  Var(Ar(£))) 

2  Indvar(x)  3  pro j ( Var ( 0b(£ ) )xVar ( 0b( l ) ) ) 

2  Indconst(i)  , 

with  the  class  of  predicate  symbols 

PredsyraU)  =  <=ob(£).  =Ar(r)>  =r el(£)'  h £  '  £  '  €  '  •••) 

used  in  the  theories  ThIsyn(r)  .  ThII(SynU)  . 

Some  selected  first-level  syntax  axioms  and  rules  from  the  18 
given  in  Appendix  2  on  Deduction  Categories  and  Formal  Topoi  (at  the 
end  of  2.4): 

For  all  i  e  Var (Ob (£) )  , 

RI ,  syn,  1 ;  i  :  (S<^(i))  -obB(Id(i))  -ot)i  ) 


For  all  k  «  Var(Ar(£)  , 


4,syn,2:k:  ( ko ( id ( S ( k) ) )  -ApId(B(k) )«k  -Apk) 
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) 


and  ( S ( f )  =obB(g)),  ( S ( g)  =obB(h))  . 

Rr  f  :  For  all  f,g,h  e  Var(Ar  {£)) 

I , syn, 4 ;  f  ,  g ,  h 

then 


( ( f°g)°h  =Arfo (g®h) )  . 
For  all  i,j  €  Var(Ob(£))  , 

^1 , syn ,  8  ;  i  ,  j  :  <B(proJ1(1) (IxJ, , 

(B(proj2 (  }  ( ixj) ) 


Ob 

Ob 


i)  . 

j)  • 


I , syn ,  17 ; f , a ' 


For  all 


1  e  Cons ( 0b( t ) ) ,  all 

3 


f  e  Var ( Ar ( t ) ) 


{ B  ( f )  =ob  la)  then  (f  =Ar  pa(S(f)))  (pa  a  terminal  arrow 

map)  . 


Remarks  concerning  individual  variables. 

Recall 

Indvar(f)  -  pro j ( Var ( Ob ( t ) )  x  Var(Ob(£))) 

=  u  Indvar i ( t )  . 

i€Var(Ob(f)) 

If  1,  j  e  Var(ObU)  )  , 

S  proj  (  (  ix  j  )xi  )  projjj*^  •  Indvar  ±  ( X. )  , 

Xixj  ~  proj  (  (  ix  j  )x  j  )  £  proj***^  €  indvar^U)  , 

etc . 

We  usually  drop  the  subscript  I  x  j  from  the  individual 
variables . 

Note  also  the  distinction,  e.g., 
x<*>  =  pro j3 ( kxix ix jx i ) 


and 


y^'  -  proj2  (kxixix  jxi  )  . 

Also,  note  the  basic  identification  (which  can  be  produced  as  a 
rule  or  theorem)  for  all  i,j  e  Var(Ob{£))  , 

projl(i)(i)  =  Id(i)  :  Id(lxJ)  =  <proji(ixi),  pro j  .  ( ix j ) J > 

=  <x(  ^  ,  x  ^  >  ,  etc . 
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Finally,  note  the  individual  constant  maps: 

Indconst(X)  =  u  Ar^  1  (Var(x))  , 

fj€Var(Ob(X) )?  ) 

I l_eCons(Ob(x) ) [ 

where  category  Var(X)  is  described  later. 

Analogously,  s econd-level  syntax  axioms  and  rules  may  be  ob¬ 
tained  as  abstracts  of  the  corresponding  ones  in  Appendix  2  at  the 
end  of  2.4.  In  the  following  development,  all  quantities  may  be 
interpreted  as  ordinary  sets  and  elements  and  all  predicate  symbols 
such  as  may  be  correspondingly  interpreted  as  the 

usual  set  relations  with  only  a  minor  abuse  of 

notation. 


RII,syn,l 


(Var (Typ(x ) )  s  Term(x)  =  u  Term  (x)  . 

aeTyp(X )“ 


For  all  a  ,  p  ,  F  ,  g  : 

RII,syn,2,  :  If  (a,/3  €  TYP(*))  and  (9  e  Var  ( X )  )  , 
a 

(  (F  :  a  -*  p)  €  Arr(X)  ) 

then 

(F(g)  e  Terra^ (X ) )  . 

For  all  i  , 

RII,syn,3;i  ••  If  e  Var(ObU)))  , 

then;  (Indvar^X)  £  Indterm^X})  . 


For  all  i, j,x(1) ,  f  , 


II , syn, 4 , 
1, j.x*1* ,f 


[i 

then 

where 

Note,  using 
f luous . 


If  ( I ,  j  «  Var  (Ob  (i )  )  )  ,  (x(1)  «  Indvar^x))  , 

(f  e  Var(Ar(X)})  ,  and 
( S ( f )  =  i)  ,  { B ( f )  =  j)  , 
e. ,  f  =  (f  :  i  -  j)]  . 

(f (x(i) )  e  Indterm^(X))  , 
f (x(i) )  =  fox!i)  . 

f  =  (Id(i)  :  i  -*  i)  ,  Rtt _  „  becomes  super- 

I I , syn , 3 


then 
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({xixj  I  r)  6  IndtermQ(1) (i) )  . 

Again,  see  Appendix  2,  end  of  2.4  for  all  corresponding  deduc¬ 
tion  rules  and  axioms  concretized  for  deduction  categories  and 
formal  topoi. 

We  summarize  the  axioms  »nd  rules  previously  introduced  and 
connect  them  with  Term(£)  in  order  to  produce  the  class  of  well- 
formed  expressions: 


“l.syn'11  - 


(Ax 


I  ,syn 


U> 


^I.synt*)) 


where,  because  axioms  can  usually  be  formed  as  deduction  rules  and 
vice-versa,  we  simply  abuse  notation  and  let 


A*I.syn(A>  U 


RulI,syn<*> 


a 

s 


R-r  ,  U  .  . 

I ,  syn ,  1 

SeqTerm(z) 


U  RI,syn,17 
"  Term<x)  *  <=ob'  =Ar ' 


=Rei>  x  Term(X)  . 


II , syn' 


AxII,syn(i)  U  RulII,syn(i)  RII,syn,l  U  *”  U  RII,syn,ll 

U  RI,Th,0  U  RI , Th, 1  U  RI , Th, 2 
£  Seq^  ^  ^  ^  )  —  A  j  j  ( )  x  —  }  x  Aj^(t-)  , 


where 


A__  (t )  »  (Term(t),  Term  (l)  ,  Term  (£),  Indvar.fl), 

II  a  p  i 

Indterm^(t),  Atom(t),  Form'(t),  Form(£),...}  , 


and  where ,  e . g . , 
R 


T  ,  -  {Rt  ,  ,  I  all  1} 

I,syn,l  I, syn ,1,1 


?  —  /  R 

II, syn, 11  '  II, syn, 11; 

1  x(1)  7 

3'  (ixj)  ,Y 


|  aU 


We  combine  the  two  theories  together  yielding 

Th  (i)  =  (Ax  (L) ,  Rul  (I))  , 
syn  '  syn  syn 


Ax 


syn 


( t )  u  Rul  (t) 
'  syn  '  ' 


etc . 
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RI,syn,l  U 

•  .  , 

U  RI,syn,17 

u  R, , 

.  u 

••■UR,, 

II , syn. 

1 

II , syn, 1 1 

U  RI,Th,0 

u  •  • 

U  RI ,Th, 2 

S  SeqTerm{£) 

U) 

U  SeqAIl(£)(i)  • 

In  turn,  now  form  the  class  of  deducts 

De(Th  (£))  =  span{Th  (£))  . 
syno  syno 

Next,  project  out  the  first  component  in  all  sequents  of  the 
form  (ree)  and  intersect  these  with  the  appropriate  subclass  of 

Terra(£)  to  obtain  the  well-formed  subclasses  of  Term(t)  : 

Wfter  o(t)  -  pro  j  (•€••)  (De(Th  (/.)))  , 

*  © 

the  class  of  all  well-formed  terms  (in,  Term(Jt))  ,  which,  by  use  of 

Thsyn  '  raay  be  identified  with  Var(Ar(£))  U  Var(Rel(£))  , 

Wffo(i)  ^  Wfter o(t)  n  Form(Z)  , 
the  class  of  all  well-formed  formulas  (wff's)  ; 

Wfato(£)  =  Wftero(£)  n  Atom(t)  , 
the  class  of  all  well-formed  atomic  formulas; 

Wf  to ( t )  =  Wftero(£)  n  Indterm(i)  , 
the  class  of  all  well-formed  individual  terms, 

Wf vo(x.)  =  Wftero(£)  n  lndvar(£)  , 
the  class  of  all  well-formed  individual  variables  , 

Wfco  U )  =  Wftero(£)  n  Indconst(i)  , 
the  class  of  all  well-formed  individual  constants. 

Then  define  the  well-formed  expressions  of  L  as: 

Wf ex  (£)  3  wff  (£)  u  Wft,  .(£) 

©  o'7  (  o  ) 

=  WfteroU) 

=  Var(Ar{£))  u  Var(Rel(£))  , 

since  we  can  use  Rj  gyn  14  to  identify  any  r  «  Var(R el(t))  with 
r £ Id(o (r ) ) ]  €  Wffo(t)  ,  noting  that  Id{o(r))  e  Wf vo(£)  ,  and  where 

compatible  with  Thsvn  U)  ,  we  can  identify  the  originally 


28 


Goodman  and  Nguyen 


apparently  smaller  class  Var(Rel(x))  with  Wffo(x)  ! 


IhSY»  <*> 


with  the  appropriate  spans  generating  Term(X)  , 


Form (X),  Atom(x),  Indterm(x)  ,  etc.,  implies  Var(X)  is  a  category 

and  $ar(X)  =  (Var(x),  x  (X))  is  a  deduction  category,  where  k(x) 
is  specified  below.  (See  the  appendices  at  the  end  of  section  2.4 
for  definitions  and  properties  of  deduction  categories  and  formal 
topoi . ) 

First, 


Var(x)  =  (Ob(Var(x)),  Ar(Var(x)))  is  a  category,  where 


Ob(Var (X ) )  §  Var (Ob(x ) )  =  { i , . . . , j , . . . , k, . . . ) 


Ar (Var (i ) )  =  Var(Ar(z))  =  {f , . . . ,g, . . . ,h, . . . }  , 


where  we  note  that  (because  of  Th  (x)  )  we  may  identify 

* 1 o 

f  =  (f  ;  i  _  j)  ,  for  i,j  e  Var(Ob (x))  ,  with  f  e  Ar.  .(Var(X)) 

•1  »  J 

and  the  identify  arrow  is  Id(i)  *  (Id(i)  :  i  -♦  i)  (having  the 
usual  category  properties)  and  composition  here  (is  derived  from 
o  :  Ar  (X. )  x  Ar (X )  Ar(X)  as  a  function  symbol)  is  the  usual 

category  composition:  (f  :  i  -*  j)  ,  (g  :  j  -*  k)  ,  yields 
(g°f  :  i  -«  k)  ,  etc. 

In  addition,  x(x)  :  Var(x)  Preord  is  a  contravariant  functor 
given  as  follows: 

For  all  i  «  Var ( Ob ( x ) )  , 

x(x)(i)  =  (r  |  r  «  Rel(X)  and  o(r)  =  i)  . 

We  write 


x(t) ( i )  =  { r ^ ,  t  ^ ,  •••)  • 


The  predicate  symbol  is  an  appropriate  partial  order  on 

each  K(X)(i)  by  use  of  axioms  RITh0  -  RI(Th,2  and  RIsynl5 

(see  Appendix  2,  at  the  end  of  2.4).  In  turn,  we  can  extend  this 
order  in  a  natural  way,  for  any  r^,  s^  e  x(X)(i)  : 

Define  (r..  L  si)  iff  (ri  \~L  Sj)  e  De(Thsyn  b(x))  .  for 

b , 

the  appropriate  case  of  b  (b  =  0  ,  here,  later  extended  to  b  =  K) 

.  For  any  i,j  €  Var(Ob(X))  ,  and  any  (f  :  i  -*  j)  (i.e.,  any 
f  e  Ari ^ ^ ( Var (X ) ) )  , 

K(X)(f)  €  ArK(£)(j)f  3C  ( x  )  )  (  i)(Preord) 

where  for  any  s^  «  X(X)(j)  , 

*(i)(f)(Sj)  S  Sj[f]  , 

and  since  o(s.[f])  =  i  (see  axiom  R  .  of  Appendix  2)  , 

j  x  t syn / x  x 
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*U)(f)(Sj)  e  x(t)(  i)  . 

Note  also  K(i)  is  order  preserving  w.r.t.  h  by  use  of 
RI,Th,0  ’ 

Finally,  the  basic  syntax  of  L  will  be  completed  with  the 
addition  of  the  free  and  bound  individual  variable  mappings. 

The  free  individual  variable,  total  individual  variable  and 
bound  individual  variable  maps  are  given  by,  respectively, 

FVoU),  T Vo(t),  Bn VoU)  :  Wf exo U )  -  *(Wf vo(/.)) 
given  by  BnVo  U  )  =  TVo(t)  H  FVq  U )  ,  where 

(i)  For  all  x(l)  «  Wfv^f)  ,  i  e  Var(ObU))  , 

TVo(x(l))  =  FVo{x(l))  ^ 


and  hence 

BnVo  (x*1*  )  =  0 

(ii)  For  all  i,j  e  Var(ObU)),  (f  :  i  -♦  j)  e  Var(ArU)), 
t(1)  «  Wft  o{L)  ,  r  =  r1  (i  *  a  ( r) )  e  Var  (Rel  (je.  )  )  , 

TVo(f(t(1)))  «  FVo(f(t(1))}) 

-  TVo(ri[t(i,l)  *  FVo(r1[t{i)]) 

=  TVo ( t ( 1 } )  =  FVq ( t ( 1 } )  . 

Hence , 

BnVo(f[t(1)])  =  BnVo(ri[t(1)])  =  BnVo(t(i))  =  0  . 

The  class  of  all  formal  sentences  of  l  is 

SentoU)  3  {y  |  y  .=  Wfexo{£)  and  FVo(y)  =  0} 

=  {y  |  y  e  Wff oU)  and  FVo(y)  =  0} 

where  the  class  of  all  open  expressions  of  L  is 

OpenoU)  3  {y  |  y  €  Wfexo(i.)  and  BnVo(y)  =  0} 

2  Wft^U)  u  Wf  ato  ( L  )  . 

The  sequent  class  for  Wffo(£)  relative  to  the  special 
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predicate  symbol  t-  can  now  be  formed.  This  class  is  the  basis  for 
investigating  all  additional  theories: 

SeqWff  ll)U)  3  *  <H}  *  Wffo^> 

© 

=  {(?*-«)  |?,e€  wff 0[t) )  . 

In  summary,  the  basic  syntax  of  t  has  been  established  and 
this  point  -  we  may  express  L  as 

isvn  =  (Core(i),  Th  U),  $ar (t)  :  Wfexo(t),  FVoU),  BnVe(/.))  , 
with 

SeroU)  3  SeamfJt)U)  "  De  ( Thsyn  J  A  ) ) 
as  the  class  of  deducts  or  theorems  of  interest  relating  to  . 


II.  Expanded  syntax  of  L 

In  general,  we  may  wish  to  add  more  structure  (or  in  fact  take 
away  structure)  to  L  in  the  form  of  an  additional  theory,  say 
ThK(£)  .  However,  this  theory  may  well  entail  further  operations 

and  constraints  on  them.  Hence,  i  must  be  expanded  appropri- 

'  Q 

ately  in  order  to  serve  as  a  basis  for  the  extended  language. 

Specifically,  let  ThR(Z)  be  a  theory  concerning  logic,  set 

foundations  and  quantifications,  which  will  be  treated  in  the  next 
section.  Define 


AX 


syn. 


(1)  U  Rul 


K 


syn. 


U) 


K 


=  Ax 


syn 
u  R. 


(t)  u  Rul 


syn 


U) 


o 

U  R 


1, syn, 36  ' 
has  a  concrete  counterpart  in  Appen- 


I , syn , 1 8 

where  as  before  each  Rj  syn  u 
dix  2  at  the  end  of  2.4. 

Then  analogous  to  the  development  in  (I),  we  obtain  for  the 
expanded  syntax  theory  Th  „  (£)  =  (Ax^,,,  (i)  ,  Rule„„U)) 


syn. 


syn 


K 


syn' 


De  ( Th 


syn 


U)  ) 


K 


In  particular,  the  free  variable  total  variable  and 


bound  variable  maps  now  extend  to: 

For  any  nil,/,  ri’’'',rn  €  Wff0(£) 
q  e  Quan(Z),  f  s  Loc n(^)  >  I  €  Var(0b(i))  , 


x(i)  «  Wf vi(l)  , 
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n 

FVR(-r 

,,y  ))  .  u  fvk(?u 

U>  =  1 

n 

TVK(T('l,‘  •  • 

,,y  ))  =  U  TV 

W  =  1 

FV„( (x(1) 

1  *))  =  FVv(q  x(1) 

=  fvk(¥-)  -t  fvk(x(1)) 

BVk({xU)  I  IP})  =  BVK(qx(1)  )  (*)  ) 

=  (X(1))  U  BVK(1P)  . 

TVk({x(1)  |y}}  =  TVk(qx(1)  (?)  )  =  TVk(lP)  U  FVk(x(l))  . 

We  abuse  notation  somewhat: 

{x(ix^}  =  (x(i>.  x<*>).  etc. 

Thus,  we  now  can  let  the  formal  language  be 


with 

and 


(CoreU),  Th  (t),  Sfav(t)  .  WfexvU),  F V  U),  B V  (A)) 
synK  k  k  k 


SeqWffK(t)<^  “  WffK(i)  X  {Kt)XWffK(i)  ' 


Ser^(t)  =  Seqwf  (t)  n  De(Th  (A)) 


Note  that  it  is  assumed,  without  loss  of  generality,  that 
all  pertinent  symbols,  including  function  symbols  constituting 
FuncSymb(Z)  (and  its  subclass  of  distinguished  function  symbols)  and 
9ar ( L  )  ,  are  given  in  the  beginning  of  the  construction  of  l  ,  al¬ 
though  relations  between  them  and  their  compounds  and  substitutive 

forms  may  or  may  not  be  present  through  Th  (£)  or  even 

syno 

Th  {l)  .  In  any  case,  not  due  to  any  structural  changes,  but 

ynK 

rather  to  the  class  of  well-formed  expressions  apparently  shrinking, 
because  of  further  identifications  of  forms  through  the  added  theory 
ThK(t)  ,  we  employ  the  subscript  K  in  Wfex^Ji)  ,  WffK(i) 

Wfat V{L)  ,  Wf v (l )  ,  etc.  However,  again  abusing  notation,  we 

k.  \  K ) 

assume  that  we  can  identify  all  compound  wff’s  with  appropriate 
elements  in  Var(Rel(t))  and  all  such  wft’s  with  these  in 
Var(ArU))  . 


Added  theory  for  t 
and  quantifications. 


for  It 


set  foundations 


Define  the  added  theory  ThK(£)  =  (A xK(Z),  RulK(£))  by 
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AXKU,  U  Ru1k(Z)  =  RlfThf3  u 


U  RI,Th,24  S  SegWffK(£) {L)  ’ 


where  again  each  Rj  Th  u  has  an  obvious  concrete  counterpart  given 

in  Appendix  2  at  the  end  of  section  2.4. 

We  then  define  the  full  expanded  theory  for  L  as 

Thsy„K.K<i>  '  <**sy„K.K<i>  '  Rulsy„K.K< ' > >  ' 

where 


Ax 


synR ,  K 


(l)  U  Rul 


synK ,  K 


(l )  =  Ax  (*  )  u  Rul  (l ) 


syn. 


syn. 


K 


U  Axv(t)  U  Rul  ( £ )  =  R_  ,  U  • 

K'  synK  I , syn , 1 

u  Rtt  ,  u 

II , syn, 1 


u  R 


I , syn , 36 

U  Rtt 

II ,syn, 11 


U  RTh,0  U 


u  R„ 


Th ,  24 

Hence,  finally,  we  define  the  full  expanded  formal  language  as 


‘sy„,.K  -  (Core(i)  .  Ih  KV).  V.rU).  FVKCO.  BV^It)) 

K  K 

with  the  general  deduct  class  and  deduct-of-interest  class  given  as 
De(ThsynK.K<t)>  =  sPan{ThsynK,K(A)) 


and 


SerR(i)  = 


Seq 


wffRU 


(i)  n  De ( Th 


synK,K 


U>> 


With  this  additional  structure  (summed  up  by  Th_„_  „(/.)) 

syn^  ;  fw 

the  deduction  category  \7ar(£)  becomes  now  a  formal  topos.  This  is 
a  basic  algebraic  structure  for  semantic  evaluation.  Furthermore, 
this  structure  can  always  be  Imbedded  in  a  natural  way  into  a  topos 
which  plays  a  key  role  in  set  theory.  (See  section  2.4.) 


IV .  Additional  remarks 


We  may  wish  to  consider  additional  concepts  for  l  .  For 
example,  we  may  wish  to  add,  e.g.,  the  concept  of  double  implication 
«=»  or  alternate  denial  or  other  quantifications  or  logical  con¬ 
nectives,  without  defining  them  as  primitive  concepts  (such  as, 
elements  of  Arr(i)  ,  here)  which  are  described  by  additional  axioms 

and  rules  added  to  Th  V(L)  .  This  can  be  done  by  defining 

syn^ ,  k. 

these  operations  eliminatively  through  those  already  established 
through  i  .  For  example,  («=»  :  Rel(i)  x  Rel(£)  -♦  Rel(X.))  is  de¬ 
fined  most  naturally  as 
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=  M=> 


=►  < 

( composition) 


which  through  the  category  theory  structure  of  Coreft )  is  a  legi- 
tamate  member  of  Arr(t)  ,  as  are  &  and  »  .  (See  for  example, 
Coste  [41],  p.  10,  where  T  ,  F  ,  =»  ,  e  ,  v  ,  3  can  all  be  defined 
eliminatively  through  only  v(=or)  ,  a(=&)  ,  *  ,  n  ,  {•)••)  ,  •[••]  , 
a  .)  Note  also  that,  compatible  with  the  intuitionistic-like  struc¬ 
ture  of  Th  -(Z)  »  we  can  define  negation  nt  directly  through 


a  and  X  formally  as  nt (•)  =  (•)=»  l  (•)  .  See  also  the  Godel- 
Intuitionistic  Logic  in  section  (C)  ,  2.3,2  and  the  remarks  on 

Fuz(H),  etc.  in  section  2.4.  (See  also  Maydole  [172],  p.  13  for 
other  examples  of  eliminative  definitions.) 

An  example  illustrating  the  form  of  a  well-formed  formula  and 
well-formed  terms: 

Let  :  i,j,k  «  Var ( 0b(Z  )  )  :  ( f  :  i  -  j )  ,  (g  :  j  x  j  -  k)  , 

(h  :  j  -•  j)  «  Var(Ar(X )  )  ;  x^  €  Wfv^L),  x^  ,  z^  e  Wfv^U), 

z(k)  €  Wfvk(£)  ,  y(1)  €  Wfvi(i)  ;  rk,  tk,  wk  e  Var(Rel(z))  with 
°(rk)  ■  °(tk)  =  °^wk^  =  k  ’  and  consider  nt  ,  &  ,  and  =»  c  Loc(z)  . 


Assume  throughout  that  S(x^i^)  =  S(x^)  =  S(z^)  =  s(z'kM  = 

Sfy^1*)  =  l  ^  i  x  j  x  j  x  k  x  i  ,  and  hence,  x^1*  =  x^1*  = 

projl(i)(<)  '  xU)  =  X<(J)  =  proj2(j){<)  '  etc* 


foX(1)  =  f(x(1>),  hofoy(1)  =  h(f(y(i)))  e  Wft^(x)  , 
g(<f(x(1)),  h(f(y(1)))>)  €  Wftk(x)  , 
rk[g(<f (x( ) ,  h(f (y( ) )>) ]  e  Wfat(z)  , 


with  signature  i 


sk[g(<z<^,  x^>)]  e  Wf  at  ( l )  , 


with  signature  l, 

wk[z(k)]  e  Wfat ( t ) 
with  signature  l  .  Then,  in  turn, 

y  -  ( (rktg(<f (x(i) ) ,  h( f (y(i) ) )>) ]  &  nt ( SjJg ( <z ( J 5 ,  x(j)>)]))  » 

( k> 

wk[z  ])  €  Wff (Z)  .  Indeed,  7  e  Open(Z)  ,  with  a (7 )  =  l  . 

Then  note  that 


9 


-  v  (  i  ,  j*  jxkxi  ,7  )  3  (Vx^*)(?) 
=  (vx(1))  ?[<x(1),  x(^,  z(^ 


=  (vx(i))y  [Id^] 
z(k) ,  y(i)>]  e  wff (X) 
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with 


FV { 9 )  =  {x  ^  i  z  ,  z  »  y  }  * 

Also,  note  that 

*  =  v  (  j  ,  ixkx  i  ,  3  (  j , ix  jxkx  i  ,  nt(sk[g(z^  ,  x^)]))) 

=  (vx(  j)  )  (3z(  )  (nt(sk[g(z(j>  ,  x(;i))]))  e  Sent  (t)  . 

Also,  note  the  use  for  o(s)  =  i  x  j  , 
w(i>  2  pro  j  i  (kx  ix  j  )  ,  2  pro  j  ^  (kx  ix  j  )  ,  =  projk(kxixj)  , 

A  2  (vk) ( ix  j ,  r[proj .  .  (kxix j ) ] ) 

+  *  J 

=  (vw(k))(r[<w(1),  w(j)>HIdkxixj]) 

=  (Vw(k)  )  (r[w(i)  ,  w(j)>][<w{k)  ,w(l) 

2  (vw^’llrlw'11,  w<J>,  w<k>))  , 


etc . 

(See  Appendix  2  at  the  end  of  2.4  for  further  details  concern¬ 
ing  the  properties  of  V  ,  3  ,  a  ,  e  ,•[••],{•  |  ••}  ,  etc . ) 

V.  Summary  of  structure  of  t 

A  formal  language  may  be  developed  in  two  stages,  stage  0  the 
syntactic  level  and  stage  K,  the  expanded  level  (or  more  generally, 
for  b  s  K) . 

Let  subscript  b  =  0  or  K  ,  with  b  =  0  referring  to  the 
basic  level  of  development  of  t  and  b  =  K  referring  to  the  addi¬ 
tional  level  of  development  Then,  denoting 

i  =  i 

syn  , 0  syn 

O  w 

Th  .  2  Th 

syno , 0  svn0 

Ax  _  2  Ax  , 

syn  , 0  syn 


Rul  „  2  Rul 

syno  ,  0  syno 


we  have : 
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tsvn  b  =  (CoreU)'  Thsyn  ,b(t'"  Var(t)'  WfexbU)'  FVb{i)'  BnVbU))' 
*  b '  b 

Sort (4 )  =  { Ob ( 4 ) ,  Ar ( 4 ) ,  Rel(4)}  , 

Core ( 4 )  =  (0b(Core(4 ) ) ,  Ar(Core(4)),  the  type-category, 

Ob j (4 )  =  0b(Core(4))  =  Typ(4)  =  span ( Sort ( 4 ) ,  x,  n)  , 

Arr(4)  =  Ar(Core{4))  =  FuncSym(4) 

2  Cat(4)  u  Foun(4)  u  Loc(4)  u  Quan(4)  . 

(See  part  I  for  a  listing  of  special  designated  function  symbols.) 
Var  :  Typ(4  )  Ob(SET)  , 


V ar(4)  =  u  Var(a)  ,  where  in  particular 

ae  Typ ( 4 ) 


Var ( Ob ( 4  )  )  = 

{  i  ,  .  . 

.  ,  j,  . 

.  .  ,k,  .  . 

•  > 

Var ( Ar ' 4 ) )  = 

(f ,  .  . 

.  ,g,  ■ 

.  .  ,h,  .  . 

.  )  with  f  =  (f  ;  i-*  j) 

,  etc 

Var ( Rel ( 4 ) )  = 

<r,  . 

■  .  ,S, 

.  .  .  ,  t ,  . 

.  . }  with  r  =  r .  ,  etc . 

Var(4)  =  (0b(Var(4)),  Ar(Var(4)))  ,  the  variable  object-arrow 

category, 

Ob  ( Vai  i  )  )  =  Var  (Ob  (4  )  )  ,' 

Ar(Var(4))  =  Var(Ar(4))  , 

?ar(4)  =  (Var (4),  M(4))  the  variable-deduction  category, 

K(4)  :  Var(4)  -*  Preord,  the  variable-relation  contravarlant  functor. 

For  all  i,j  €  Var(0b(4))  ,  all  ( f  :  i  -»  j )  e  Var(Ar(4))  , 

3C  ( 4  )  (  i )  =  (r1  |  ri  €  Var  ( Rel  ( 4  )  )  ,  ofr^  =  i)  , 

«(4)(f)  e  ArxU)(j)xU)(i)  (Preord)  , 


with  for  all  r^  €  a(4)(j)  ,, 

K(4)  (f  )  (Tj)  =  r  j  [  f  ]  e  3C(  4  )  (  i )  , 

Predsymb ( 4 )  =  (=Qb,  =ftr ,  =Rel ,  S,  ^ ,  ...} 


In  a  sense,  span  refers  to  suitably  well-defined  operations 
mined  by  Th^^)  : 

Ter.n(4)  2  Wfterb(4)  =  span(Var(4),  FuncSym(4)) 


=  Var(Ar(4))  u  Var(Rel(4)) 
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=  wf ex  ) 


synb, b 


n  b  ' 

*U>>  - 


SeqWff b(i)U)  =  wffb{i)  x  {^}  *  wffb(i)  • 

A  word  of  caution:  Unfortunately,  the  word  "sentence"  is  used 
universally  in  the  literature  of  formal  linguistics  and  the  analysis 
of  natural  language  to  refer  to  any  (compound)  form,  obeying  natural 
language  grammar/syntactics.  By  the  Parsing  Principle,  this  cor¬ 
responds  to  some  wfex  (well-formed)  expression  in  the  formal 
language.  However,  the  latter  may  or  may  not  be  a  formal  sentence. 
In  addition,  such  wfex’s  usually  have  specific  labels  for  their 

individual  variables,  such  as,  x  ^  =  "John"  and  y^n  *  =  "happy", 
etc.,  as  in  the  natural  language  sentence  "John  is  happy,  but  Sam 
likes  to  read".  These  labels  indicate  which  individual  variable 
assignment  map  values  are  to  be  given  to  the  free  variables,  as  part 
of  the  semantic  evaluation  of  the  wfex.  (See  2.3.2.) 


VI.  Basic  theorems  of  Th  „(£) 

- syncs’  L- 

In  the  following,  a,b,c,d,...  e  »(i)  for  some  i  e  Ob(C)  , 
(c,a)  a  deduction  category  with  all  additional  properties  as 
required.  Equality  "  =  "  really  means  "  =  M.  "  ;  "  £  "  means 


l , Th , a 


*(i) 

,  a  =  0,1, 


(ii)  a  s  b  iff  a  a  b  =  a  iff  a  v  b 
(iii)  a  =  b  iff  aAb=avb=a=b 
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Proof:  (i)  :  R!  '  R4  '  R5  '  R9  '  Rjo  ' 

(ii)  :  If  a  £  b  ,  since  a  £  a  by  R1  ,  then  a  S  a  a  b  , 

by  R-  .  But  a  a  b  s  a  by  (i)  and  hence  a  =  a  a  b  .  If 

O 

a  —  a  a  b  ,  then  since  a  a  b  £  b  ,  by  ( i )  ,  R2  implies  a  s  b  .  A 
similar  proof  holds  for  the  remainder. 

Theorem  2  Symmetry 

a  a  b  =  b  a  a  . 

Proof:  From  Theorem  1  (i),  and  Rg  ,  a  a  b  £  b  a  a  and  by  arbitra¬ 
riness  of  a  and  b  ,  b  a  a  <  a  a  b  and  hence,  a  a  b  =  b  a  a  . 

Theorem  3  Idempotence 

aAa  =  a  =  ava  . 

Proof :  By  Theorem  1  (i)  ,  for  b  =  a,aAaia.  But  since 

a  s  a  ,  a  ,  by  Rg  ,  a  s  a  a  a  .  Similarly,  use  Rg  to  obtain 

a  =  a  v  a  ,  using  Theorem  4. 

Theorem  4  Marginal  properties 

aAT  =  TAa  =  a 
avT  =  Tva  =  T 
a  a  i  =1  a  a  =  1 

a  v  i  =  x  v  a  =  a 

Proof:  Use  Rg  ,  R?  ,  etc. 

Theorem  5  Associativity 

a  a  (b  a  c)  =  (a  a  b)  a  c 

a  v  (b  v  c)  =  (a  v  b)  v  c 

Proof:  By  Theorem  1  and  R4  ,  a  a  b  £  a  ,  (aAb)  a  c  s  a,  c  . 

Then  Rg  implies  (aAb)AcSaAC.  Similarly, 

(aAb)  AcSbAC  .  Hence  by  Theorem  3, 

(a  a  b)  a  c  =  (a  a  b)  a  c)  a  (a  a  b)  a  c)  5  (a  a  c)  a  (b  a  c)  <  a  a 
(b  a  c)  .  By  arbitrariness  of  a,b,c,  we  can  reverse  inequality 
with  a  and  c  interchanged. 
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(iii)  Use  part  (ii),  applying 

(iv)  Note  that  (iii)  above  applied  to  Theorem  1  (i)  yields 
nt(a  v  a)  £  nt(a)  ,  nt(b)  s  nt(a  a  b)  , 
and  hence  by  R_  and  RQ 

D  O 

nt(avb)  ^  nt(a)  a  nt(b)  , 
nt(aAb)  £  nt(a)  v  nt(b)  . 

Finally,  note  that  by  part  (ii)  above,  assuming  distributivity 
(see  Theorem  8) 

nt(a)  a  nt(b)  a  (avb) 

=  (nt(a)  a  nt(b)  a  a)  v  (nt(a)  a  nt(b)  a  b) 

3  (nt(a)  Aa)  v  (nt(b)  Ab)  =1  vi  =i  . 

Hence,  applying  R12  : 

nt(a)  a  nt(b)  a  (avb)  s  i 
Hence , 

nt(a)  a  nt(b)  s  nt(avb)  . 

Similarly  by  considering  the  above  proof  for  a  a  b  replacing 
avb,  R j  2  again  yields  the  desired  result. 

Note:  Since  a  and  v  are  associative  and  symmetric,  they 

can  be  extended  recursively  in  the  obvious  way,  unambiguously,  to  an 
arbitrary  finite  number  of  arguments,  and  in  turn,  formally  to  any 
number  of  arguments. 

(v)  Consider  T  =  n t(l)  =  nt(a  a  nt(a))  =  nt  a  v  nt(nt(a))  , 
a=aAT=aA  (nt(a)  v  nt(nt(a)) 

=  (a  a  nt(a)  v  (a  a  nt(nt(a)))  =  a  a  nt(nt(a))_ 

Theorem  6  Non-decreasing 

If  a  s  b  ,  then  a  a  c  s  b  a  c  ,  avcsbvc  . 

Proof :  By  Theorems  2,  5,  3, 

(a  a  c)  a  (b  a  c)  =  (a  a  b)  a  c  =  a  a  c  . 

Then  by  Theorem  l(ii),  a  a  c  s  b  a  c  . 
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Theorem  7  Negation  properties 


Define  nt(a)  =  a  »  i  . 
Then 


(i) 

(ii) 

(iii) 

(iv) 

(v) 


nt (T)  =  l  ,  nt<l )  =  T  . 


nt(a)  a  a  =  l  (Law  of  Excluded  Middle). 
Hence,  if  a  s  b  , 


nt(b)  a  a 

If  a  S  b 

nt(a  v  b) 
nt(a  a  b) 


i  . 

then  nt(b)  s  nt(a)  (Nonincreasing). 

nt(a)  a  nt(b)  s  nt(a) 

nt(a)  v  nt(b)  (DeMorgan's  Laws) 


nt (nt (a) )  *  a 
nt ( a)  v  a  *  T 


in  general, 
in  general, 


but : 


nt(a)  v  nt(nt(a))  =  T  , 
as  nt ( nt (a) )  . 


Proof :  (i)  Use  R^  ,  etc. 

(ii)  Use  RX1  ,  noting  b  S  b  ,  l  s  l  and  hence 

b  a  (b  »  i)  s  X  ;  then  use,  e.g.,  Theorem  6.  Then  by  Theorem  1 
(ii) ,  as  nt ( nt (a ) )  . 


Theorem  8  Distributivitv 

a  a  (b  v  c)  =  (a  a  b)  v  (a  a  c)  , 
a  v  (b  a  c)  =  (a  v  b)  a  (a  v  c)  . 

( Proof :  Since  (a  a  b)  s  (a  a  b)  v  (a  a  c)  and  (a  a  c)  s  (a  a  b) 
v  (a  a  c)  ,  Rg  Implies  aA  (bvc)  s  (aAb)  v  (sac)  .  The  con¬ 
verse  procedure  Is  easily  verified.) 

Theorem  9 


(1) 

a  a  (a  a  b) 

s  b  s 

(a  a 

b)  . 

(ii) 

If  a  2  b  , 

(a  =» 

b)  = 

T  . 

( Proof : 

Use  R11  and 

R12 

•  ) 
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Remarks ■ 


Thus ,  under 


ThsynK,K(i> 


an  intuitionistic-like  theory,  it 


follows  that  operators  a  and  v  will  have  the  properties  of 
general  t-norms  and  t-conorms,  respectively,  and  nt  is  a  general 
negation  (see  section  2.3.6)  with  the  added  structures  of  idempo- 
tency,  DeMorgan  and  distributivity  -  all  properties  that  ordinary 
minimum  and  maximum  have. 

It  may  be  desirable  to  replace  Th  ..(£)  —  perhaps  by  a 

Sylijr  t  8 

less  restricted  theory— where  a  is  replaced  by  &  being  a  t-norm, 
v  by  or  ,  being  a  t-conorm,  nt  (and  hence  =>  )  by  a  general  ne¬ 
gation  (and  hence  implication)  operator.  However,  in  this  case,  the 
formal  topos  structure  will  no  longer  be  valid. 


2.3  Semantic  evaluations. 


2.3.1  Preamble. 

Two  schools  of  thought  arise  regarding  the  semantic  or  truth 
evaluation  of  a  formal  language.  The  platonistic  approach  inter¬ 
prets  the  semantic  evaluation  as  a  statement  about  the  real  world. 
On  the  other  hand,  the  formalistic  approach  allows  no  such  inter¬ 
pretation  . 

Despite  the  setbacks  in  platonism  (•’he  advent  of  non-Euclidean 
geometry,  discovery  of  paradoxes  in  naive  set  theory,  Godel's  un¬ 
certainty  theorem,  Heisenberg's  uncertainty  principle,  etc.),  the 
philosophy  has  a  great  deal  of  appeal.  This  is  because  of  the 
continuing  egocentric  viewpoint  that  Man  has  with  the  rest  of  the 
universe.  Hence,  we  continue  to  persue  knowledge  and  "prove" 
theories  (within  possibly  inconsistent  systems)  and  attempt  to 
relate  these  concepts  to  our  surroundings. 


2.3.2  Basic  evaluations  of  formal  language. 

The  structure  of  a  formal  language  L  has  an  given  in  the 

previous  section  (2.2).  In  this  subsection,  we  present  semantic 

evaluations  of  l  .  These  evaluations  are  all  carried  out  within  a 

deduction  category  framework.  If  L  has  enough  structure,  (such  as 

in  the  form  i  „  see  2.2.2),  then  t  may  be  further  evalu- 

syn^,K 

ated  within  a  formal  topos.  Since  a  construction  can  be  carried  out 
showing  any  deduction  category  with  some  additional  structure  (in 
particular,  a  formal  topos)  can  be  imbedded  within  a  corresponding 
category  (or  topos)  (see  Appendix  2  at  the  end  of  section  2.4),  the 
above  semantic  evaluations  of  L  may  be  naturally  thought  of  as 
being  within  a  category  -  or  topos.  (Compatible  with  this,  we  can 
identify  any  ordinary  topos  c  as  a  formal  topos  (c,Sub).) 

Since  a  semantic  evaluation  of  L  may  be  thought  of  as  an 
interpretation  of  £  ,  there  are  (in  fact,  infinitely)  many  in¬ 
terpretations  of  £  ,  which,  although  certainly  valid  within  the 
very  general  and  encompassing  structure  of  a  deduction  category 
(including  essentially  all  systems  of  our  interest),  may  not  hold 
within  the  much  more  restrictive  form  of  a  formal  topos  (or  topos), 
even  if  £  has  sufficient  structure.  For  example,  in  a  given  situ¬ 
ation,  it  may  be  more  appropriate  to  interpret  the  logical  connector 
"  &  "  (in  Loc(£)),  not  as  a  minimum  operator  but  as  a  product  or 
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more  generally  as  a  t-norm  operator.  (See  section  2.3.6)  In  par¬ 
ticular,  (see  Appendix  2  at  the  end  of  2.4)  it  follows  that  rule 
RI  Th  6  vi°^ate<^>  in  general,  unless  a  is  a  minimum  type 

operator.  On  the  other  hand,  we  may  still  wish  to  retain  -  at  least 
formally  -  the  same  evaluation  procedure,  as  if  a  formal  topos  were 
present.  We  will  apply  this  principle  to  the  important  case  of 
general  logical  systems  (section  2.3.5),  the  cornerstone  of  the 
approach  taken  in  this  text  to  the  analysis  of  uncertainty,  and  in 
particular,  to  the  combination  of  evidence  problem  (see  Chapter  8). 

We  can  summarize  the  role  of  natural  language,  formal  language 
and  semantic  evaluations  in  the  analysis  of  uncertainty  as  follows: 

Natural  language  can  be  used  as  the  vehicle  to  express  cogni¬ 
tion  explicitly,  although,  often  somewhat  vaguely.  (But  keep  in 
mind  that  all  "rigorous,  mathematical”  concepts  are,  by  definition, 
expressable  in  natural  language.)  Through  the  Parsing  Principle, 
all  of  this  is  translated  into  a  formal  language.  Thus  the  problem 
of  uncertainty  or  combination  of  evidence  being  considered  concern¬ 
ing  an  unknown  parameter  -  such  as  "How  old  is  Charles?",  through 
all  the  information  supplied,  including  statistical  data,  linguistic 
information,  and  general  inference  rules  that  are  applicable  -  may 
be  modeled  at  first  within  a  formal  language  structure.  (See 
Chapters  8  and  9  for  further  details  on  how  such  problems  may  be 
treated.)  Then  a  class  of  appropriate  semantic  evaluations  must  be 
chosen  to  determine  the  truth  content  of  all  of  the  information  as  a 
whole,  and  in  particular,  that  for  which  projects  to  the  unknown 
parameter.  First,  we  choose  the  class  of  possible  semantic  evalu¬ 
ations  upon  the  relevant  formal  logical  connectors  Loc(i)  -  such  as 
nt  (not),  &  (and)  ,  or,  if  (  )  then  (  ),  etc.  -  and  relevant  for¬ 
mal  quantifiers  QuanU)  -  such  as  v  (for  all),  3  (there  is), 
Mo  (most),  etc.  This  is  the  logic  for  the  problem,  i.e,,  a  col¬ 
lection  of  semantic  evaluations  having  in  common  a  particular 
evaluation  upon  Loc(i)  u  Quan(A)  is  called  a  logic.  (See  2.3.3.) 
Next,  we  consider  also,  in  effect,  a  subclass  of  semantic  evalua¬ 
tions  upon  Foun(£)  ,  the  set  theory  connectors  e  -  membership, 
s*.  -  equivalence  of  sets,  etc.,  which  have  common  values  and  which 
are  compatible  with  the  logic  chosen.  (See  2.3.4.)  In  addition,  we 
consider  the  class  of  semantic  evaluations  upon  the  basic  "objects" 
involved,  yiej.uj.ng  the  universes  of  discourse  or  semantical  domains, 
such  as  the  set  of  all  men,  the  set  of  all  ordered  pairs  of  numbers, 
one  representing  heights  of  women  in  inches  and  the  other  weights  in 
lbs.  (See  the  development  in  this  section  for  details.) 

Also,  in  conjunction  with  the  above,  we  must  determine  the 
appropriate  class  of  semantic  evaluations  upon  other  basic  relations 
(corresponding  to  predicates  or  verbs  in  natural  language)  and  wff's 
(well-formed  )  formulas,  formed  as  compounds  (or  strings  or  the 
spanning)  of  basic  relations  together  with  the  substitution  of 
various  functions  (or  terms)  within  the  relations.  The  semantic 
evaluation  of  relations  yields  dispersions  or  membership  functions: 
in  general,  numerical-valued  functions  upon  their  signatures  which 
become  their  corresponding  universes  of  discourse  through  the 
semantic  evaluation.  If  any  universes  of  discourse  have  as 
elements,  "events"  or  equivalently  sets  of  more  primitive  elements  - 
such  as  in  the  case  of  probability  measures  and  Choquet  capacity- 
type  measures,  where  for  the  former,  a  given  universe  of  discourse  - 
corresponding  to  a  given  probability  measure  (there  may  be  many 
probability  measures  for  a  given  problem,  and  hence  many  universes 
of  discourse)  -  consists  of  o-algebra  of  subsets  of  a  given  set  or 
equivalently  a  Boolean  algebra  of  events,  then  we  call  the  corres¬ 
ponding  dispersions  uncertainty  measures  . 
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Finally,  we  evaluate  semantically  all  the  relevant  individual 
or  particularized  variables  from  the  universes  of  discourse  -  such 
as  "Charles",  "age  of"  and,  in  turn,  by  appropriate  substitution, 
obtain  the  desired  numerical  truth  values. 


I .  Semantic  evaluations  within  a  deduction  category. 

Let  (C,a)  be  a  given  deduction  category,  with  finite  products 
and  having  possibly  additional  properties  when  needed. 

Define  a  typical  partial  semantic  evaluation  (or  structure  or 
assignment)  map  from  Core(i)  U  Term(t))  into  SET  ,  denoted  sym¬ 
bolically  by  ii  ii.  .  :  (Core(i),  Term(t))  -♦  SET  where 

V  5  t  ”  / 

n  ii  :  Core(t)  -♦  SET  is  made  into  a  functor  between  categories 

Core(i)  and  SET  .  (Unless  stated  otherwise,  both  L  and  (c,a) 
are  fixed,  and  consequently,  we  will  drop  the  subscript  (c,a)  from 
the  semantic  evaluation  map  from  our  writing,  but  not  from  our 
thoughts!  Later,  another  subscript  w  will  be  used,  but  for  a 
different  purpose.) 

Consider  first  the  three  distinguished  objects  Ob(Jt)  ,  Ar { L )  , 
Rel(r)  «  Ob(Core(£))  : 

The  class  of  universes  of  discourse  or  semantical  domains  is: 
il Ob ( JC. ) n  =  HVar  (0b(£  )  )n 

=  {Hill  ,iii1»  ,  .  .  .  ,li  jli  ,  .  .  .  ,ukii  ,  .  .  . } 

where  typically  ii  ill  «  Ob(C)  ,  and  hence 
ii  il  :  Var(ObU))  -  Ob(c )  ,  with 

iiVar(ObU)  )li  =  (Il  ill  )  i  €  Var ( Ob ( X. ) )  >  £  Ob(c)  . 

Similarly,  the  class  of  arrows  between  the  universes  of 
discourse  is: 


n  ArU  )H  =  ii  Var  (Ar  (4  )  )n 

=  {il  fH  ....  ,  il  gll  ....  ,  ll hll  ,  ... ) 

£  Ar(C) 

and  the  class  of  relations  on  the  universes  of  discourse  is: 

li  Rel  (£  )  il  =  il  Var  ( Rel  ( t )  )ii 

=  {ll  rii  ....  ,  it  sii  ....  ,n  til  .... } 

£  Rel (c ,*)  ^  u  a ( a )  . 

a«Ob(c  ) 


Furthermore,  let 

ii  Sort  (i  )  ii  =  {  ii  Ob  { t )  ii  ,  |  Ar(i  )n  ,  it  Rel  (a  )  ii  )  . 
In  turn,  for  any  a  e  Typ(t)  ,  define 

nail  =  il  Var  (a)  il  =  {Hall  |  a  e  Var  (a))  . 
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ll  Typ  ( A )  ll  =  {Hall  |  a  s  Typ(A)} 

=  span{  H  Sort  (A )  ll  ;  x  ,  ft)  , 

where  x  and  fl  are  cartesian  product  and  exponentiation,  if 
needed,  for  c  , 

Thus,  e.g.,  Ob(A)  x  Rel ( A  )  e  Typ(A)  and 

iiOb(A)  x  Rel  (A  )  ll  =  ll  Ob  ( A  )  ll  x  ll  Rel  ( X. )  ll  ,  etc. 

Consider  next  the  semantic  evaluation  on  FuncSymb(A)  (the 
arrows  of  Core (A))  : 

For  any  a,p  €  Typ(A)  and  F  =  (F  :  a  -»  p )  e  Ara  ^(Core(A))  , 
choose  some 

IIFU  =  (HFII  :  Hail  -  H/JII  )  €  Ar„a„  ,  „  a,,  <  SET>  ' 

i.e.,  HFii  is  some  ordinary  function  from  the  set  Hall  to  the  set 
n/311  ,  where  for  any  a  e  Var(a)  and  hence  F(a)  e  Termp(t)  , 

nF(a)i:  =  iifii  (nan)  ,  noting  n  ail  e  nail  »  HVar(a)n  . 

Also,  for  any  (F  :  a  ■*  p ),  (G  :  p  ■*  i  )  e  Ar(Core(A))  ,  and 
id  :  a  -*  a  in  Ar(Core(A))  , 

CL 


IIGoFIl  =  ll  Gil  ll  on  iiFII  , 


11  ld^" 
a 


=  id 


Hall 


where  the  composition  and  identity,  ®  ,  id  ,  in  Core(A)  become  or¬ 
dinary  composition  lloii  and  ordinary  identity  id  for  SET  .  From 
now  on,  we  abuse  notation  and  let  il®n  =  ®  ,  etc. 

Obviously,  the  partial  semantic  evaluation  maps  of  most 
interest  are  those  which  are  compatible  with  Ths  (A) 

Specifically,  we  say  ll  ll  is  a  mode/  of  T^svn  (*)  >  iff  by  defini- 

*  © 


tion,  for  any  (V  i  s)  «  Axg  (4)  u  Rulcvn  (A)  ,  nil’ll  Hull  nan  holds, 

*  o 


where,  e.g.,  lull  is  ordinary  =  ,  for  if  «  <=0b'  =Ar'  =Rel^  and 

lien  is  ordinary  e  . 

It  follows  that  the  same  property  holds  for  all 

(T  i  a)  e  De(Th  (A))  .  (This  definition  of  a  model  extends  to 

syn 


any  given  theory  and  formal  language  in  the  obvious 


we  now  assume  that 


ll  ll 


I  s  a  a  ode  I  of 


Th 


syn 


way . 

U) 


within  de¬ 


duction  category  (C,a)  . 

Thus ,  we  have : 

For  any  f  «  Var(Ar(A))  ,  we  may  write  f  =  ( f  :  i  -*  j )  « 

Ari  ^(Var(A))  for  some  i,j  e  Var(Ob(A))  ,  and  hence  (from 

Thc’n  <*)>  -  S(f)  =  i  ,  B(f)  =  j  ,  where  S  =  (S  :  Ar  ( A )  Ob(A))  , 
syn^ 

B  =  (B  :  Ar { A )  -  Ob ( A ) )  .  Then 
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'If  =  («f »  :  Hill  -  lljll)  s  Ar|tln  (||  ju(C)  , 


where 


HSU  =  (HSH  :  HVar(Ar(x ) )n  ll  Var  ( Ob(X  )  )  n  )  , 

I'  Bit  =  (HBli  :  ii  Var  ( Ar  (X- )  )  il  -  ll  Var  ( Ob  (X  )  )  n  )  , 

Iildii  =  { II  Idii  :  II  Var  ( Ob  (x  )  }ii  il  Var  { Ar  (x  )  )  11  )  , 

il  pro  jn  =  ( il  pro  j  li  :  il  Var(Ob(x  )  )n  x  iiVar(Ob(x  )  )n  -*  ll  Var  ( Ob(X  )  )  il  ) 

are  the  source  (or  domain),  codomain,  identity-making,  and 
projection-making  maps  for  c  ,  respectively.  Thus, 

ll  S  (  f )  il  =  HSll  (Ilf  il  )  =  Hill  > 

It  B  (  f )  IT  =  HBll  (Ilf  ll  )  =  lljll  . 

For  any  i,j  s  Var(Ob(x))  , 

*  ll  Id  ( i )  ll  =  Id||lH  , 

an  identity  arrow  Id|lin  :  H  ill  u  in  of  c  ,  and  for  any 
f,g  e  Var (Ar (X ) )  with  S(f)  =  S(g)  ,  i.e., 

f  =  (f  :  i  -  j)  and  g  =  (g  :  i  -»  k)  , 

say,  then 

ii  pro  j  ^  (  i  ,  j  )  (fxg)ll 
=  ll  pro  j  1ll  (ll  ill  x  lljll  )  { ll  f  ll  x  llgii) 

=  nf  il  . 


Similarly,  it  follows  that  ll  it  is  compatible  with  all  axioms 
and  rules  (from  Thsyn  (£))  characterizing  the  functional  symbols 

S  ,  B  ,  o  ,  Id  ,  pro j  ,  x  ,  <  >  ,  a  ,  and  •[••]  . 

In  particular,  consider  again  ll  He  1  ( X  )  ll  : 

For  any  r  «  Var(Relfx))  ,  we  may  write  r  =  r  ,  .or  r  =  r 

o  ( r )  i 

i  =  o(r)  =  o(r1) 

Now,  the  function  symbol  o  =  (o  :  Rel(i)  -  Ob(X))  has  the 
evaluation 


lloil  :  ll  Var  (Rel  (X  ) )  ll  -*  HVar  (Ob(x  )  )ii 
the  signature  map  for  (c,»)  and 

H- [••]!!  :  ii  Var  ( Rel  (X  )  )ii  x  ll  Var  ( Ar  (X  )  )  il  -*  HVar  (Rel  (X  )  )ii  , 

the  substitution  map  for  (c,»)  . 

no  ( ri )  ll  =  il  in  =  ii o ll  ( ll  r^ii  )  . 
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For  all  (f  :  j  -«  i)  ,  (g  k  -+  j )  «  Var(Ar(£)>  ,  o(ri)  =  i  , 
o(rd[f])  -  j  . 

o,(rJ[f][g3)  =  o(r1[fog])  =  k  , 
o(Id(o(ri)))  =  o(Id(i))  =  i  , 

KjH  =  Ho  (r^[f  ]  )n  =  noil  (iiri[f  ]n  )  =  Hon  (a  (  n  f  n  )  ( n  r^i  ) )  , 

iikn  «  iio(r^[f][g])n  =  non  (it  ri[f°g]n )  =  Hon  (a(iifiioiigii  ) )  (nr  .11  ) )  , 

"i"  =  "0<Id(0(ri)))"  "  l,0,,<Id«o«(nr1»>)  =  llo,,(I<W  • 
noting 


nri[f  ]»  =  a (nf  11 )  ( 11  r ^  11 )  , 
a  (ii  fit  )  :  a  ( 11  ill  )  -*  a  ( 11  jn  )  . 

Clearly  in  establishing  the  functor  11  11  :  Core(i)  -►  SET  ,  we 

have  also  established  ll  11  as  the  functor  11  11  :  Var(t)  -»  c  . 

Next,  we  consider  the  predicate  symbol  J-  ,  representing 

(through  Ri,xh  0  '  fiI,Th  2  J  a  partial  order  over  Var(Rel(t))  . 
Define 


'"V 


SPreord  ' 


where,  for  any  i  e  Var(ObU))  ,  and  hence  Hill  e  Ob(c )  and 
a  ( 11  in  )  «  preord , 

(»V  I  »{ni»)  *  S*(lllll)  . 

But  from  the  properties  of  sPreord  here,  it  follows  that 
RI,Th,0  “  RI,Th  2  are  realized  compatibly: 

For  any  i ,  j  e  Var(Ob(X))  and  ,  si  ,  tjl  «  Var(Rel(i))  , 

with  o(ri)  *  o( Sj)  =  o(ti)  =  i  ,  and  (f  :  j  -  i)  €  Var(Ar(t))  , 

(i)  If  (ri  Sj^)  (abstractly)  then 
"ri"  sa(„i..)l,si"  ' 

"rjlfl"  =  i»  fti )  ( u  rdn  )  sa(|JJ|,  )H81[f]"  =  a(iifii)(iisiii)  , 
(ii)  |riu  sa(,til,  )l,rj"  - 
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ril  II  :  *U)  -1-  *o11  11  • 

since  for  all  i,j  e  Var(Ob(£))  ,  (f  :  j  -»  i)  «  Var(Ar(£))  ,  and 
hence 

II  f II  ;  njll  -*  lllll  , 
a  (Ilf  II )  :  a  (II  ill  a  (II  jit )  , 
and  therefore 

(a®  ii  H  )  { f )  <  r  „  ,|(i)(r1)) 

=  a  (ii  fn  )  (  ii  r  ^  H  ) 

-  nri[f]n 

-  (’■„  „  ( j)  )  (*(*>  (f )  )  • 

Next,  consider  H  H  over  Wfexo(£)  : 

Again  noting  that  (due  to  Th  (*■))  Wfexo(i)  £  Wftero(£)  , 

'  o 

the  evaluation  of  11  ll  over  Wfexo(i)  ,  and  hence  over  Wffo(£)  , 
Wfato(4)  ,  Wftjoj(t)  ,  WfVjoj(i)  and  WfCjoj(x.)  ,  become  a  special 
case  of  determining  n  11  over  Wftero(£)  . 

Thus : 

iiWfc(o)  U)H  =  { - »lan,  . . .  , 
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where  ll 1  a«i  ,  11  ,  ...  are  terminal  objects  in  c  ; 

HWfv{o  (4)n  =  . . y(J,il,  ,.x(k)l.,...>  . 

where  typically 

Y(i)  =  yixkxj  *  Pr°J j(ixkxj)  • 

x(k)  =  XW  =  projk(ixk)  «  Var(Ar(£) )  , 

lly^if  =  llpro  jll  (|  ( ll  ill  x  il  kilxil  jll )  e  Ar(c)  , 

iix*k*ii  *  liprojli|(ku  (H  iiixiikii )  e  Ar(C)  ,  ...  ; 

HWft(o)U>H  =  ( .  .  .  ,»f  (y(3) )n  .  Hg(h(z(k)),  x(i))ll,...>  , 

where  typically,  (f  :  j  -*  k)  ,  (h  :  k  -»  <)  ,  (g  :  4  x  i  -*  j)  e 
Var  ( Ar  (4 ) )  ,  y(i)  =  yj^j  «  WfVj(4)  ,  z(k)  =  z<J£  e  Wf  vfcU)  ,  x(i)  = 

Xixk  €  wfv1(i)  *  i,j,k,4  e  Var (Ob(i ) )  and 

nfn  «  Aru  jii  ,iikii  '  "h"  €  Arukii  ,Htii  *  '  l,gl1  €  Arii<iixii  in  t  ii  jii  (c  )  ’ 

,,f{y<j))"  *  ''£''0''Y{j)|'  €  Ar„iiixiikllxn  jll  ,lik)l  (C)  ' 

Hg(<h(z(k) )  ,  x<1>»« 

=  ..g..(<.lh..°i.z<k>..,  HX^IO)  e  Ar|(il|xllk(1>llj|1(C)  , 

«Wfato(i)ii  =  {...,iltjxk[g(<h(zlk)),  x(i)>)  x  f(y(j))]ii _ } 

where  typically,  g  ,  h  ,  z^k^  ,  x^1-^  ,  y^^  are  as  above  and 

tjxk  e  Var(RelU))  (°(tjxk)  *  j  *  k)  , 

l|tjxlc[g{<h(z(k))  ,  x(1)>)  x  f(y(J))]n 

=  a(iig(<h(z(k) )  ,  x(i)>)n  x  iif(y(J))»)  (Ht^^i)  «  Rel(c.a)  . 

This  completes  the  specification  of  il  ll  over  Wfex^fi)  , 
noting  that  because  only  ThSyn  U)  is  present,  n  n  over 

Wffo(z)  -i  WfatoU)  ,  in  general,  is  arbitrary  fixed  so  that  for  any 
r  «  Wffo(£)  i-  wf  ato  {l  )  ,  imp  ii  «  Rel(c,a)  with  11011(11*11)  =  iio(*)ii  , 
etc.,  as  is  compatible  with  ThSyn  (4)  only. 
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We  also  obtain 

II FV  il,  II  TV  ll,  UBnV  11  :  llWfex  (4)11  -*  V(UWfv  (4)H) 

n  a  o  a  '  n 


and  in  turn,  ll  Sent0  ( 4 )  u  ,  U0peno(4)ii  in  the  obvious  ways. 

Thus,  in  summary,  we  can  write  the  typical  partial  semantic 
evaluation  model  symbolically  as 


II  II  :  4 


syn 


SET  , 


where 


it  ll  :  Core(4)  -*  SET  is  a  functor, 

(ii  ii,  n  )  :  9a r(4)  (c,a)  is  an  arrow, 

if oi ll  =  llVar(a)ii  ,  a  c  Typ(4)  , 


iiVar(ArU)  )H  u  ll  Var  (Rel  (4  )  )  H  ■  Ar(C)  u  Rel(c,r,) 

=  iiWftero  (4  )it  =»  span  (ii  Var  (Typ(4  )  )H  ,  »'  FuncSym(4  )  ii  ) 

2  HWfex0  (4  )  ll  =  iiWff0(4)n  u  nWft(o  j  (4)11  , 

iiWfato(4)ii  =  span  (ii  Var  (Rel  (4  ) )»  x  iiWf  t{fa }  (4  )  n  ;  a  ( n  •  •  ll  )  { n  •  n  )  )  , 
li  Wf  t0  { 4  )  ll  =  span(  ii  Wfv^o  j  (4  )  ll  ;  ll  Var  ( Ar  (4 ) )  ll ) 

2  HWfVo(4)ll 

=  pro  j  (  ii  Var  ( Ob  (4  )  )  ll  x  n  Var  ( Ob  (4  )  )  li  ) 

2  pro j ( u Cons { Ob( 4 ) )n  x  ll  Cons  ( Ob(  4  )  )  ll  )  , 

"Thsyn  (-c)"  =  {(IMf’11  1,711  liell)  l  (1P  7  6  Thsyn  (x))  ' 

i)De(Th  (4))»  *  (  (inf'll  ii  v ii  u e ii  )  |  (y  t  e)  e  De(Th  (4 )  )  )  , 

syu 

etc . 

Note,  that  the  evaluation  of  any  y  e  Wfexo(4)  ,  depends  on  the 

evaluation  of  TVo(y)  -  which  is  a  collection  of  projections.  Each 

such  projection  becomes  a  projection  in  the  sense  of  category  c 
(under  the  evaluation),  in  a  natural  corresponding  sense.  However, 
this  is  not  enough  to  specify  an  actual  realization.  For  example, 
let  r^  «  Var(0b(4))  and 

x(i)  =  Xkxi(i)  =  proji(k  x  i)  «  Wfvi(4)  s  Var ( Ar (4 ) )  ,  yielding 
r  -  «  Wfato(4)  (with  signature  o(r^[x^^])  =  k  x  i)  . 
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Then,  as  we  have  seen,  inf’ll  =  a(iix^^n )  (Ur^il  )  ,  where 

il  x  ^ 1  ^  ti  =  proj|,  ^|(  (ukiixii  in  )  e  Ar(C)  and  nr^ii  e  Rel(c,»)  (with 

noil  ( ii  r^n  )  =  ii  111  ,  etc.).  However,  if  we  wish  to  further  evaluate 

ii tp ii  for  particular  possible  domain  values  of  iix^^i.  (relative  to 
ii  r ^  11  ,  of  course)  ,  we  must  consider  some  kind  of  constant  arrow  in 

c  which  in  some  sense  has  its  constant  value  in  Hill  e  Ob(C)  . 

To  abstract  the  above  need  in  a  general  setting,  we  will  assume 

the  following  for  deduction  category  (c,s)  : 

(w )  For  each  a  «  Ob(C)  ,  there  exists  (possibly,  many)  termi¬ 
nal  objects  1  such  that  the  arrow  (f.  :  1  -»  a)  e  Ar(c)  is  a 

a 

subobject  of  a  (i.e.,  a  monic  -  see  Appendix  1  at  the  end  of  2.4). 
Denote  the  class  of  all  such  arrows  (as  a  runs  over  Ob ( Z. )  )  as 
9"  (c )  .  in  turn,  consider  some  collection  of  interest  a  £  Wfexo(.')  . 

Then  define  a  mapping  (again  as  for  H  H  ,  in  general,  there  are  an 
infinite  variety  of  choices)  called  the  Individual  variable  assign¬ 
ment  map 


<*>  :  FVo(a)  -  ar  (c )  , 


such  that  for  any 


x(i)  e  FV  (A) 

O  '  7 


(x(1)  e  Wfv1(b) 


etc .  )  , 


w(x(1) )  = 


for  any  choice  of  f. 

•‘ii  in 

Finally,  the  full  semantic  evaluation  of  t 

sYno 

the  selected  collection  of  well-formed  e.-:..r  ssions 
denoted  as 


together  with 


in 


sYnr, 


is 


(«,*)  , 

=  £  is  understood) 

*  O 

II  II  :  L  -*  (C  ,»)  . 

u  '  ' 

it  should  be  noted  that  many  semantic  evaluations  Indeed  satis¬ 
fy  condition  (u)  .  Using  the  previous  example  of  v  =  r^[X^^]  , 

we  note  the  following  interpretations  for  deduction  category  (C,‘i) 
(see  Appendix  2  at  the  end  of  2.4): 

(i)  (C,»)  =  (SET,  dub)  . 


ii  ii  :  l 
w  syn 


or  simply  (where  l 
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Then,  noting  that  if  i M  e  Ob(SET)  and  here  lt|  ^  «  Hill  may  be 
arbitrary,  iirJi  £  Hill  ,  and  without  loss  of  generality  we  have 


in 


Ln  ill 


in 


,  a  one-point  map,  so  that 


imi  =  Sub(f,  )  (iir.H  ) 
w  1  ii  in  1 


=  f"1  (up  ii  ) 
ii  ii 


establishing 

1ii  ii 

<3 

when  a 

iff  1„ 

iff  1„ 

point  1 

H  e  II  r i  H 
e  nriii 

Hill  is 

t 

in  the 

relation  nr^ii 

or  not . 

(ii)  i 

(c,st)  = 

(SET,  H‘ 

)  .  h‘ 

being 

exponentation , 

where 

H  =  [0,1]  . 

Again 

,  Hill  e 

Ob  ( SET ) 

and 

1,.ii,  e  1111,1 

arbitrary , 

as  before.  But  here  ilr^n  e  HUi"  and 


imi  =  H  (f .  )  (nr.ii 

u  1  .  i 

Mill 


=  iir^ii  °  f 


in 


"  "rill(1i,il.)  * 

establishing,  in  a  general  sense,  the  degree  to  which  assignment 
is  compatible  (or  is  "in")  with  the  evaluation  iirJi 


,U) 


ii  in 
of  r 


of  x ' 
i  • 

Again,  to  reiterate,  from  now  on  throujh  the  remainder  of  the 


text,  the  semantic  evaluation  ii  ii 


sumed  to  be  a  model  of  Th 


syn 


syn 


-♦  (c,a)  is  always  as- 


( t )  .  When  the  structure  i  =  L 


sVn„ 


is  understood,  we  simply  write  ii  ii  .  L  -*  (C,a) 
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II .  The  canonical  deduction  category. 

In  part  (I)  of  this  section,  a  formal  language  LSyn  was 

evaluated  in  a  given  deduction  category,  and,  further,  the  concept 
of  models  within  that  deduction  category  was  treated.  But  note  the 
obvious  partial  converse:  A  deduction  category,  namely 

Var(t)  =  (Var  (£),»(£))  ,  is  always  present  within  £  (see 


2.2.2)  so  that  ll  II 


Var (£ ! 


:  L  -»  Var (£)  is  a  canonical  model  - 

syn 


and  Var (l)  can  be  considered  a  canonical  deduction  category  - 
defined  as  follows: 

(i)  For  any  a  e  Typ(£)  , 

Hall  =  ll  Var  (a)  il  =  Var  (a)  , 

Var (£)  Var (£) 

(ii)  ii  Wf  tero  (/. )  ll  =  Wftero(£)  ,  etc., 

(iii)  Ordering  is  the  same.  We  can  let  (r  £  s)  =  (r  s)  , 

for  all  r,s  €  De(Th  >(£))  '  b  =  °<K>  etc. 

syn^ , u 

Thus,  essentially  n  u  :  l  -»  ^ar(£)  is  an  identity-like 

Var (£ ) 


III.  Semantic  evaluations  within  a  formal  topos. 


In  part  (I)  of  this  section,  we  developed  a  class  of  semantic 
evaluation  procedures  of  L  with  values  within  a  given  deduc- 

synQ 

tion  category  (c,a)  .  Some  of  the  syntactic  axioms  and  rules  (the 

second  level  ones  -  R_T  , ,,  and  R__  Involved  certain 

1 1, syn, 10  1 1, syn, 11 

of  the  specially  designated  function  symbols 

(n  :  ObU)  -  0b(£)),  (e  :  0b(Z)  Rel(i)),  (&  :  RelU)n-.  Rel(i)), 
(v  :  Ob ( i )  x  Ob ( t )  x  Rel(t)  -♦  Rel(t)),  ...)  only  in  a  cursory  way. 
However,  it  is  often  important  to  give  these  additional  symbols 

meaning.  t  J  certainly  at  the  formal  level  presents  one 

^  y  *  **' 

approach  to  their  interpretation.  In  turn,  the  semantic  evaluation 
of  this  part  of  formal  language  must  also  be  considered.  That  is, 
we  need  additional  structure  on  (c,»)  in  order  to  evaluate  the 
important  aspects  of  set-theoretic  foundations,  logical  connectors 
and  quantifiers. 

The  additional  structure  for  (c,a)  we  consider  here  is  in  the 
form  of  a  formal  topos.  (See  Appendix  2,  end  of  section  2.4.) 


We  simply  extend 


(c  ,»)  to 


:  i  „  -*  (c,*)  (now  a  formal  topos)  over  the  set-theoretic 

syn^ , n 


n  ii 
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foundation  symbol  class,  Foun(£)  =  (ft,  «,  x,  {•!••},  ...}  ,  the 

class  of  logical  connectors,  Loc(£)  =  {nt,  &,  or,  =» ,  ...}  ,  and  the 
class  of  quantifiers,  Quan(£)  =  {v,  a,  ...}  .  In  addition,  we  wish 

this  extension  to  be  compatible  (analagous  to  that  for  £  with 

sYn0 

Th  (£))  with  Th  „{£)  ,  i.e.,  to  be  a  model  for  £  „  . 

syno  synK<  K  synK,K 

In  particular, 

HFoun(£)n  =  Foun(c,a)  , 

II  Loc  ( £  )  ll  =  Loc  ( C  ,a)  , 

MQuan(£)H  =  Quan(c,»)  , 


where  again,  see  Appendix  2  for  all  characterizing  properties. 

Note  also,  that  analagous  to  part  (II), 

ii  ii  :  £  v  -  Vrar(£)  can  be  considered  a  canonical  model 

Var  ( £  )  synK'* 

for  £  in  ^ar(£)  ,  having  now  a  formal  topos  structure  (A  la 

syrig ,  K 

ThK(£))  . 


In  summary, 
into  formal  topos 
II  ll  :  £ 


we  write  for 
(c,a)  as 

synR , K  '  ' 


a 


semantic 


evaluation  of 


'synK,K 


and  if  individual  variable  assignment  map  w  is  also  applied,  we 
write 


ll  ll  :  £  „  -*  (C  ,a)  . 

u  synK,K  ' 


Or,  again  for  simplicity,  when  the  structure  of  £  is  under- 

synK , k 

stood,  we  write  simply 
ii  ii  £  -♦  (c  ,a) 

and 


ii  uu  :  £  -  (c,a)  , 
respectively . 


IV.  Semantic  evaluations  within  a  category  or  category  with 
additional  structure,  or  topos. 

As  ( 1 )  a  consequence  of  the  universal  arrow  and  imbedding  pro¬ 
perties  of  deduction  categories  with  possible  additional  structure 
within  the  generic  (y (c ,») ,Sub)  as  given  in  Appendix  3,  section 
2.4.2,  (2)  because  of  the  characterizing  result  that  include,  e.g., 
(C,Sub)  is  a  formal  topos  iff  C  is  a  topos  (see  the  beginning 
part  of  Appendix  3),  and  (3)  compatible  with  the  basic  literature. 
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denote  the  semantic  evaluation  n  11  :  i  -*  (c,Sub)  as  simply  n  n  : 

I  -*  c  .  Similarly,  11  il  :  l  -*  c  is  a  model  means  11  li  :  L  -*  (c,Sub) 
is  a  model  (for  some  theory  Th(t))  .  Similar  remarks  hold  for 

II  II  :  l  -►  c  . 

u 


V.  Summary 

Why  consider  (deduction)  categories  and  (formal)  topoi  for  se¬ 
mantic  evaluations  of  a  formal  language? 

Firstly,  deduction  categories  are  the  appropriate  vehicles  for 
the  ranges  of  semantic  evaluations  of  a  formal  language  L  ,  since 
as  seen  in  the  constructions  in  this  section,  they  contain  all  -  and 
in  general,  require  no  more  than  -  the  structure  necessary  for  cer¬ 
tain  of  the  semantic  evaluations  to  be  models  of  Th  it)  .  If  L 

syno' 

has  additional  structure  in  the  form  of  ThK(i)  ,  an  intuitionistic 

logic-set  theory,  then  for  n  il  :  L  -*  (c,»)  to  be  a  model,  (c,») 
must  be  a  formal  topos .  In  so  many  words:  each  topos  has  an  inter¬ 
nal  intuitionistic  logic-set  theory,  as  displayed  in  Appendix  2, 
2.4.2.  In  addition,  the  (generic)  Benabou  and  related  constructions 
(y (C ,a) ,Sub) )  show  that  without  loss  of  generality,  standardized 
and  enriched  formal  topos  forms  may  always  be  chosen  for  semantic 
evaluations  of  L  when  L  includes  Th^t)  .  (See  Appendix  3, 

section  2.4.)  Moreover,  Th^(/. )  is  sound  and  complete  relative  to 

topos  evaluations.  Similarly,  fragments  of  XhK(t)  are  sound  and 

complete  relative  to  deduction  categories  with  appropriate  corres¬ 
ponding  additional  structures.  (See  2.3.3  (b)  and  Appendix  3, 
section  2.4.)  Thus  topoi  (formal  topoi)  and  deduction  categories 
with  certain  additional  structure  represent  generalized  set  theories 
in  the  above  sense.  But  in  addition,  when  Thj,(£ )  is  modified  and 

strengthened  to  Thzer^/'^  *  topoi  (equivalently,  formal  topoi  with 

contravariant  functors  being  Sub)  correspondingly  strengthened  with 
additional  properties,  including  "well-pointedness" ,  "partial  trans¬ 
itivity",  etc.,  are  ranges  of  legitimate  models  of  Thzer(4)  »  in  a 

minimal  sense.  (See  section  2.4.2  (G).) 

But  again,  a  note  of  caution:  Because  many  of  the  problems 
that  all  of  us  deal  with  are  data-driven  or  "real-world"  ones,  it 
may  not  be  appropriate  to  assume  when  modeling  these  problems  at  the 
formal  language  level  that  the  restrictive  form  of  Thv(i)  is 

correct.  This  leads  to  the  relaxation  of  topos  structures  -  but, 
not  In  general,  category  structures,  for  those  sufficiently  flexi¬ 
ble!  This  is  considered  throughout  sections  2.3.3  -  2.3.9.  (See 
especially,  section  2.3.5.) 


2.3.3  Logics  for  a  formal  language. 

In  sections  2.2,  2.3.1,  2.3.2,  we  developed  first  a  basic 

formal  language  ig  together  with  semantic  evaluations 

*  O 

11  i  lSyn  -*  (c-a)  .  for  a  deduction  category  (c,m)  .  Then,  we 

expanded  L  to  include  a  more  encompassing  theory  ^^syn  ’ 

K 
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yielding  l 


synK,K 


II  II  :  l  v 

<*>  synR,K 


and  considered  semantic  evaluations 
(c,»)  ,  where  (C,»)  was  assumed  to  have  addition¬ 


al  properties,  in  particular,  that  of  a  formal  topos .  Indeed,  this 
structure,  when  appropriate,  leads  to  a  most  important  property  for 
any  model:  soundness  and  completeness.  (This  will  be  clarified 


later  in  this  section.)  Th 


synK,K 


as  mentioned  previously,  may  not 


be  the  appropriate  theory  for  a  given  situation  and  hence  should  be 
replaced  by  some  Th_„_  „,(x)  •  Furthermore,  (c,»)  need  not  have 

syiij,  t 

a  formal  topos  structure.  Indeed,  it  will  often  follow  that  in 

order  for  models  n  n  :  A  V1  -♦  (c,a)  to  exist,  (C,a)  may  have 

syn^  k. 

to  have  a  very  different  structure  from  a  topos.  (See,  e.g.,  sec¬ 
tion  2.3.5  where  general  logical  systems  are  considered  as  well  as 
this  subsection. ) 

Although  in  this  section  we  first  present  some  basic  properties 
and  results  for  formal  languages  and  semantic  evaluations  in  general 
-  depending  of  course  on  the  choices  of  Foun (£)  ,  Loc(X),  Quan(x)  , 
and  the  corresponding  theories  -  the  emphasis  will  be  upon  logical 
theories  and  corresponding  semantic  evaluations,  i.e.,  logics.  That 
is,  roughly  speaking,  a  logic  is  a  collection  of  semantic  evalua¬ 
tions  which  have  the  same  evaluations  (or  "trace"  cf  evaluations) 
upon  Loc(x)  u  Quan(£)  .  Similarly,  a  set  theory  "system"  (there 
being  no  term  in  vogue  for  the  analogue  of  "logic")  is  a  collection 
of  semantic  evaluations  which  have  the  same  evaluations  upon 

Foun(X)  as  well  as  usually  upon  an  associated  set  theory  and  com¬ 
patible  logic.  (See  2.3.4  for  some  set  theory  considerations.) 


(a)  Basic  definitions 


Let  t  be  a  fixed  formal  language.  We  denote  generically 
(c,a)  as  any  deduction  category  and  n  n  :  l  -*  (c ,»)  as  any 

semantic  evaluation  map.  Using  the  notation  of  2.3.2  -  but  omitting 
all  unnecessary  subscripts  -  we  present  a  list  of  relevant  basic 
definitions  involving  logical  considerations: 

For  any  i  e  Var(Ob(x))  ,  define 


Wff(i>U)  S  (y  I  r  €  Wff(x)  and  o  (y )  =  i)  . 

(i)  For  any  ii  n  and  any  z1>r2  s  Wff^j(X)  ,  we  define 

X,  *  X  iff 
ii  ii 


"V  =  <"V  1  ri  €  zi}  sa(i)  "Z2n  y  (lly2"  1  r2  6  z2} 


a(i) 

i.e.,  for  all  T ,  e  X,  and  all  V .  «  X 


2  ' 


ii  y 


l"  ^a(Ob(C  )  )ll¥'211  '  whlch  is  read  "zi  logically  (or 

is  a 


materially)  Implies  X 2  ,  relative  to  II  II  "  or  "  X 
logical  consequence  of  for  II  II  ",  or  "the  sequent 

(Z1  ^  Z  2 }  "  {  {T1  h  ^2 5  ,1P1  €  Zl'  *2  6  Z2) 


is  valid  for 


ii 


ii 
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(ii)  For  X  £  Wff  (i)(i)  and  given  »  il  ,  we  say  "  X  is 
s  at  tsf  table  by  n  il  "  iff 

II XII  =  il  T{  i  )  ii  =  II TII  (Hill  )  . 

Thus , 

( T  ..‘-.i  ' 

and  equivalently 

(xa  |(i-|(  x)  ,  for  all  z1  £  Wff(x)  . 

We  define  compatibly 

(X)  iff  (T  11^((  X)  ,  i.e.,  X  is  satisfiable  by  h  ll  . 

Note  that  many  authors  (see,  e.g.,  Maydole  [172])  define  va¬ 
lidity  in  a  weaker  sense  than  here:  (*^  l-  ¥ 2)  is  valid  for  n  il 

in  the  weak  sense  iff  (¥  ^  satisfiable  for  ll  ll  implies  Y- 

satisfiable  for  ll  ll  )  .  Whenever  we  discuss  validity  and  related 
definitions  2nd  results  of  such  authors,  the  term  "weak"  will  be 
appended.  On  the  other  hand,  Zadeh's  "fuzzy  logic"  (e.g.,  [280])  is 
a  semantically  based  theory,  where  deduction  and  validity  coincide 
with  the  concept  discussed  here.  (See  also  section  2.5.) 

(iii)  For  X  £  Wff^.(Jt)  and  Jf  ,  a  class  of  ll  ll 's  ,  we  say 

X  is  a  tautology  with  respect  to  Jf  iff  for  all  ii  il  e  jf  ,  x 

is  satisfiable  by  ll  H  .  We  say  Y  e  Wff^j(i)  is  contra¬ 
dictory  w.r.t.  Jf  iff  ll TPii  =  Hl(i)il  for  all  ii  ll  €  Jf  . 

(iv)  Consider  now  x^  the  class  of  all  il  ll  * s  .  Then  Jf1 

decomposes  into  a  disjoint  exhaustive  partitioning  (union)  of 
equivalence  classes  L  ,  such  that  for  any  such  L  and  any 

ii  ll  ^  ^  ,  ll  ll  ^  «  L  ,  il  ll  ^  ^  and  ll  ll  ^  ^  coincide  over 

Loc(t)  and  Quan(i)  ,  the  latter,  for  also  all  individual 
variable  assignments  w  .  Call  each  such  class  of  ll  n's  a 

logic  for  L  .  Specific  examples  of  logics  are  given  in  part 

(C).  We  will  alternatively  use  the  notation  Logicn  |((£)  to 

indicate  a  particular  logic. 

(v)  For  any  Jf  ,  a  class  of  ll  H  '  s  ,  and  any 
Xj.Xj  £  Wff^j(x)  ,  we  write: 

(a)  X:  w  X2  iff  (X:  I-  X  g )  is  Jf -valid, 

i.e.,  (Z  x  h  z  g )  is  valid  for  all  H  n  e  Jf  ,  or  equivalently 
X j  logically  implies  Z^  relative  to  Jf  . 

( P )  Zj  ***  z2  iff  (z:  x2)  and  (X2  i-x  X1  )  ,  which  is 

read  "  X^  and  Z^  are  equivalent  (or  tautological ly 
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equivalent)".  Thus,  if  z  £  Wff (l)  is  a  collection  of 
tautologies  with  respect  to  x  ,  then 

w  (X)  -  (T  w  X)  is  the  same  as  (T  H  Z)  . 
XX  x 

(■» )  We  also  denote 

Val  (£.),»  it )  =  { (P  l-  0  )  |  (P  h  0 )  is  a  sequent  valid  for 

ii  ii  } 

and 

Val  (x  ,x)  =  n  Val  (i  ,  ii  H  )  , 
il  ilex 


as  the  class  of  all  sequents  valid  simultaneously  for  all 
II  ii  e  x  . 

Note  the  specialization  of  (a)  ,  (p)  ,  (t )  when  x  =  L  ;  in 

the  case  of  (t )  ,  we  use  the  term  "logically  equivalent  with 

respect  to  L  "  ,  etc. 

(vi)  We  say  a  logic  L  is  truth  functional  iff  for  eadi 
nil,  each  t  e  Locn(i)  ,  there  is  a  function  f^  such  that 

for  all  individual  variable  assignment  functions  u  ,  all 
ii  ii  e  L  ,  all  ?1#  ....  *n  «  Wff(1)(X)  , 


in  (P 


1‘ 


•V"«  = 


.IIP  II  ) 
'  n-  w ' 


The  large  class  of  truth  functional  logics  L  generated  by 
negations,  t-norms  and  t-conorms  is  treated  in  section  2.3.6.  On 
the  other  hand,  in  Example  4  of  part  (C)  of  this  section,  the 
non-truth  functional  logic  PL  (Probability  Logic)  is  briefly 
considered.  (See  also  2.3.9  (A)  for  more  details.) 

In  particular,  note  the  special  case  where  for  all  H  ll  e  L  , 


iiWff(/.)ii  =  (UPll  |  P  €  Wff(£))  =  H  , 
y  y 

a  fixed  lattice  of  some  kind  (such  as  (0,1)  for  classical  logic 
and  [0,1]  for  many  multi-valued  logics),  and  where  for  each 

i  «  Loc  (t)  ,  f  :  H0  -•  H  ;  we  call  H  the  truth  space  for  L  . 
n  •» 

We  call  the  set  of  all  possible  f  :  Hn  -♦  H  ,  Sool^fH)  ,  the  class 

of  H-valued  Boolean  functions.  We  call  the  subset  of  Bool(H)  used 
in  L,  the  designated  semantic  truth  functions  for  L  .  In  part  (C) 
of  this  section,  many  of  the  logics  are  described  through  the  evalu¬ 
ations  of  their  logical  connections  being  Boolean  functions. 

(vii)  Truth  functional  completeness.  (See  also  Rescher  [214], 
62-66,  Maydole  [172],  140-141). 

A  truth  functional  logic  is  truth  functionally  complete 
iff  all  possible  Boolean  truth  functions  (relative  to  the 
appropriate  truth  space  H  )  can  be  defined  as  functional  com- 
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binations  of  the  designated  semantic  truth  functions  (Boolean) 
for  the  logic.  For  example,  classical  logic  (Lukasiewicz  logic 
with  n  =  2  )  is  complete  with  respect  to  NOT  ,  AND  ,  OH  . 
In  multi-valued  logics,  in  general,  this  is  not  true.  However, 
Postean  n-valued  logic  is  truth  functionally  complete,  where 
indeed  a  single  binary  connective  has  been  shown  to  generate 
all  Boolean  connectives  (Rescher  [214],  p.  65).  In  particular, 
all  Lukasiewicz  n-logics,  for  Ro  >  n  k  2  ,  are  truth  func¬ 
tionally  complete.  Finally,  no  infinite-valued  logic  which  has 
a  finite  set  of  designated  logical  connectives  can  be  truth 
functionally  complete. 

(viii)  Briefly,  we  again  define  the  following: 

ThU)  =  (AxU),  Rul(A)) 

is  a  theory  for  L  ,  when  Ax(£)  ,  the  axioms,  is  a  collection 
of  relevant  sequents  from  Seq (l)  ,  the  class  of  all  possible 
sequents  of  wff's  usually  involving  the  order  symbol  t-  . 

Rul(£)  is  a  class  of  deduction  rules,  i.e.,  relations  among 
the  axioms.  In  turn,  the  class  of  all  theorems  or  deducts  of 
Th(i)  is 

De(Th(r))  =  span(Th(£)) 

(See  also  section  2.2.2.) 

Note  that  axioms  and  deductive  rules  for  theories  may  be  pre¬ 
sented  in  several  different  (and  in  general,  non-equivalent)  ways. 
(See  section  2.2.2,  for  example,  for  the  presentation  of  the  the¬ 
ories  Th  r„  U)  and  Th„U)  ,  in  terms  of  sequents,  as  we  have 
syno  x 

agreed  tacitly  to  do  throughout  the  text.)  Within  this  structure, 
one  may  choose  a  number  of  ways  that  a  given  idea  may  be  represented 
as  an  axiom  or  rule  sequent.  Consider  for  example  the  basic  (Z) 
set  theory  Axiom  of  Extension  (Zl)  (see  section  2.3.4.b),  using 
simplified  notation  for  all  individual  variables.  (Zl)  as  formu¬ 
lated  as  a  single  wff  ?  : 

(Zl)  r  =  (VX)  (Vy)  (VZ)  ( ( (x  €  y)  C=»  (X  €  Z)  )  =»  (y  *  z)  )  . 


Approach  1 :  ( T  t-  r  )  . 

Equivalently:  (e  b  r)  ,  for  all  e  e  Wff(i) 


Approach  2:  (  ((Vx)  (Vy)  (Vz)  (  (x  €  y)  c=»  (x  e  z)))  h  (y  *  z))  , 

etc . 

We  define  analogous  to  that  of  tautologies: 
l-  ( V” )  3  (Tb?) 

We  define  Th(Z)  to  be  consistent  iff 
De(Th(£))  c  seq(i)  (proper) 
and  to  be  inconsistent  Iff 


De(ThU)  )  =  Seq (i) 
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(lx)  We  also  recall  that u  t ;  L  -  (C,a)  is  called  a  model 
for  some  theory  Th [t)  iff  Ax(£)  £  Val(£,il  11  )  ,  which  ini 

general  (assuming  modus  ponens,  etc.  holding  -  see,  e.g.. 

Appendix  2,  end  of  section  2.4,  rule  Rj  xh  2^  '  in  which  cas« 

De(Th(Z))  £  Val(Jt),ll  II  )  ,  the  (deductively)  soundness  property 

of  Th (L)  for  il  H  . 

In  addition,  we  define  for  a  given  logic  L  and  theory 

Th(£)  ,  the  class  of  all  models  of  Th(£)  relative  to  L  as 

Mod ( Th (L)  , L )  =  (il  il  |  li  il  €  L  &  il  ll  is  a  model  for  Th (L))  . 

If  Mod(Th(£),L)  *  0  ,  we  say  Th  (£)  (or  A x(z))  is  model  - 

const  stent  . 

More  generally,  for  any 

Z  £  Seq(/.)  =  Wff(t)  x  (h)  x  tJff(l)  ,  we  say  Z  is  model- 
consistent  iff  there  exists  a  n  il  such  that  z  £  Val(t,  ii  n  )  . 
For  any  collection  X  of  ll  H  1  s  ,  we  say  z  is  model- 

consistent  w.r.t.  (X  iff  Z  £  Val  (£,x)  .  On  the  other  hand, 
we  say  Z  is  modei-Wcons  1  stent  for  (or  in)  x  it  is  not  model- 
consistent  w.r.t.  x  .  (in  particular,  we  can  consider  the 

case  Z  =  De(Th  (£.)).) 

We  also  abuse  notation  somewhat  and  define  the  class  of 
all  models  n  ll  :  t  ■*  (c,»)  of  Th(£)  ,  for  (C,»)  fixed  as 
Mod(ThU),  (C.*))  . 

In  turn,  we  say  Th(£)  is  a  (deductively)  sound  theory 
for  logic  L  Iff 

De(Th(i )  )  £  ValU.L)  . 

On  the  other  hand,  we  say  Th  (X.)  is  a  (deductively) 
complete  theory  for  L  iff 

Val  (/■ ,  L )  £  De(Th(i)  )  . 

Similarly,  we  also  use  the  notation,  Val  (£  ,  (c  ,*»)  )  and 

consider  soundness  and  completeness  relative  to  all  ll  ii's 

having  ranges  in  a  fixed  (C,»)  . 

(x)  Note  that  if  Mod(Th(4),L)  has  non-degenerate  members 

ii  ii, .  ,  then  if  Th{£)  Is  sound  and  model-consistent  relative 

L  ,  then  Th(i)  is  also  consistent.  (Proof:  If  Th(L)  is 
sound,  model-consistent,  yet  inconsistent,  then  there  exists 
ii  ll u  (non-degenerate)  such  that  Seq [z)  =  De(Th(£))  £  Val(i,L) 

£  Val(£,  ll  i!)  £  Seq(i)  ,  whence  equality  holds  throughout  and 
Seq  ( L )  =  Val  (i ,  ll  il  )  .  Hence  for  all  ?,9  e  Wff(i)  ,  n?ii 

w  Jj 

il  an  and  ll?  H  ll  9  il  ,  i.e.,  UAH  =  tl?il  ,  for  an  appropriate 

choice  of  . ) 


(xi)  Let  theory  Th(£)  and  logic  L  for  L  be  given. 

Th(£)  is  logically  complete  w.r.t.  L  iff  for  all 
r  «  Wff(t)  ,  If  wL?  ,  then  (►-?)  «  De(Th (/.))  . 


ThU) 
?  «  Wff (t) 


is  logically  sound  w.r.t  L  iff  for  all 
,  if  (i-?)  e  De(ThU)  )  ,  then  . 


Then : 
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Note  that  if  Th(i)  is  (deductively)  complete  w.r.t.  L  then 
it  is  logically  complete  w.r.t.  L  .  Th(£)  axioaatizes  L  iff 

Th (£)  is  (deductively)  sound  and  complete  w.r.t.  L  and  for  any 
X  £  Val  (X  , L)  ,  y  €  Wff(X)  ,  if  X  H  y  ,  then  (l-y)  €  De(Th(x)) 

(the  "capturing"  property  for  Th(x)  .w.r.t.  L  )  . 

(xii)  Finally,  note  some  miscellaneous  definitions  will  be 
mentioned: 

Th(x)  has  the  compactness  property  for  logic  L  ,  iff 

for  each  finite  set  a.  £  De(Th(x))  ,  there  is  a  li  11  e  L 

such  that  a  £  Val(x,  H  li^)  ,  implies  that  there  is  a  n  n  €  L 

such  that  De(Th(X))  £  Val(x,  ll  ll  )  .  Obviously,  if  Th(x)  is 
sound  for  L  then  it  is  compact  for  L  . 

Let  L  be  a  countable  language,  i.e.,  L  has  at  most  a 
countable  number  of  symbols  and  hence  terms,  etc.  Then  the 
Lowenhe t m-Sko l em  property  holds  for  Th(x)  iff,  when  there 
exists  ii  H  such  that  De(Th(X))  £  Val(x,  ll  ll  )  ,  then 
H  ii  :  x  -»  (c,a)  may  be  chosen  so  that  its  semantic  domain 
( ll Var ( Ob(x  )  ) ll  £  Ob(c))  is  at  most  countable. 

For  the  basic  concepts  of  negat  I  on-cons  I  stency  ((8  l-y)  and 
(e  h-not  y)  not  occurring  for  Th(X))  and  the  related  Law  or  Excluded 
Middle,  see  Rescher  [214],  pp.  161-166,  148-154.  For  other 

definitions,  including  decidability  and  recursion,  see,  e.g., 
Enderton  [54],  Curry  [44],  and  Rescher  [214]. 


(b)  Soundness  and  completeness. 

In  this  subsection,  we  proceed  to  establish  the  soundness  and 
completeness  for  certain  theories  relative  to  semantic  evaluations 
in  (deduction)  categories,  and  in  particular  in  (formal)  topoi .  The 
materials  presented  below  are  drawn  from  Coste  [41],  using  however, 
our  notations.  Roughly  speaking,  soundness  and  completeness  means 
that  the  deduction  system,  that  is,  the  choice  of  theory  Th (x)  , 

and  class  of  semantic  evaluations,  is  "correct". 

First,  (F,a)  is  called  an  arrow  between  two  deduction 
categories  (C,a)  and  (*,*)  iff  F  :  C  -♦  a  is  a  functor  and 
a  a  -»  yoF  is  a  natural  transform.  When  these  deduction  cate¬ 
gories  are  formal  topoi,  we  continue  to  call  (F,a)  an  arrow  if,  in 
addition,  a  preserves  order  and  all  the  properties  Foun(C)  , 
Loc(C)  ,  Quan(c)  ,  used  to  define  a  topos .  (See  Appendix  2,  end  of 

section  2.4.)  It  follows  that  DeCat  =  (Ob(DeCat),  Ar(DeCat))  is  a 
category  where  (Ob(DeCat)  =  class  of  all  deduction  categories,  and 
Ar (DeCat)  =  class  of  all  arrows  as  defined  above. 


Lemma  1  (See,  e.g.,  Coste  [41],  p.  15.) 


b  = 
tain 


Let  L 

0  or  K 
fraction 


be  a  formal  language  of  the  form  LSyn  b  for 


( or 


b  £ 

lK< 


K  symbolically  indicating  that  only 


of  Th„.(x)  is  assumed)  as  developed  in  section 


either 

a  cer- 

2.2.2. 


Also,  recall  the  canonical  deduction  category  ?ar(i ) 
topos  for  b  =  K  )  and  the  canonical  model  ll  ll 

Var (t ) 


(a  formal 
L  -  \far  ( L  ) 
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(see  2.3.2  (II)  ,  e.g. )  with 

ValU,  ii  it  )  =  De(Th  (i))  . 

Var(t)  synb.D 

Hence,  Th  .(£)  is  model-consistent, 
syn^ , d 

Then,  in  addition,  it  follows  that  for  each  deduction  category 
(c,a)  (formal  topos,  for  b  =  K  ,  and  more  generally,  a  deduction 
category  with  some  additional  structure  corresponding  to  symbolical¬ 
ly  b  s  K  -  see  Appendix  3,  section  2.4  for  details  on  these 
intermediate  forms)  there  is  a  bijection  y,  .  , 

r,  .  :  Mod(Th  (i),  (c,a))  -  Ar  (DeCat)  , 

'  '  '  synb,b  ( Var (£ )  ,  (c ,») ) 

the  latter  collection  of  arrows  being  order  preserving,  where  for 
any  II  n  e  Mod(Th  (C,a))  , 


^(C.a)^1  11  >  '  ("  11  <  ru  I,  )  -  where  r|(  |(  is  given  in  2.3.2. 


Theorem  1  (See  Coste  [41],  p.  15) 


is 

or 


Let  i  and  (C,a)  be  as  in  the  last  lemma. 


Then 


U) 


(deductively)  sound  and  complete  in  (e,»)  (b  =  0  or  b  =  K  , 
b  s  K)  ,  hence  (by  result  f.c)  of  part  (a))  consistent. 


(Proof :  Note  that  for  any  li  ii  e  Mod(Th  .  (£),  (c,a))  »  the  fol- 

syn^ / d 

lowing  diagram  commutes  (i.e.,  the  obvious  arrow  compositions  are 
equal )  : 


il  II 


Var (i ) 


-+  9ar(i ) 


ii  ii 


(c  ,») 


(c ,») 


Hence , 

Val (L , (C ,*) 


=  Var(i,  ii  ii  ) 

Var(i) 

=  De(Th  ,  (£  )  )  .  ) 

synb , b 


Remark. 

Any  tautology  r  WffU))  for  any  H  il(c  ,  as  above,  can 

be  written  in  the  formal  language  as  ( T^  r ,  and  hence  by  the 
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rules  (T\  =t  y^)  ,  yielding  for  any  model,  involving  T  , 

11  Ti"  ( ii  ill  ) "^i11  '  and  maximali'tY  of  HT^ll  (preserving  )  , 

liTjil  =a(n  in  )l,yin  ■  Thus,  e.g.  (by  its  very  definition)  ,  itself 

is  always  a  tautology  (for  any  model)  as  will  be  (see  2.2.2,  VI, 
Theorem  7)  ^  =  nt(y)  v  nt(nt(y))  ,  for  any  r  «  Wff(4)  ;  hence  we 

can  write  in  L  ,  (T^  0^)  •  Moreover,  by  the  soundness  and  com¬ 

pleteness  property  of,  e.g.,  Tli  _  „(A)  >  it  follows  that  all 

6yiij,  t  *> 

tautologies  <$i^  e  Wff(t)  may  be  identified  as  (T^  i-j,  . 

Next  we  obtain  soundness  and  completeness  of  Th  .  it) 

synb>b' 

relative  to  ordinary  categories  (corresponding  to  b  =  0)  ,  ordina¬ 
ry  topoi  ( corresponding  to  b  =  K)  ,  and  intermediately,  categories 
with  additional  structure  (corresponding  to  b  £  K)  .  (See  2.3.2  IV 
for  the  definition  of  semantic  evaluations  in  ordinary  categories  or 
topoi . ) 

Theorem  2 

Let  £  n  5  (b  =  0  or  K  or  b  £  K)  be  given  with  c  as 
b ' 

in  Lemma  1 . 

Then  .  Th _  ,_(£)  is  sound  and  complete  in  c  ,  i.e.,  in 


synfa ,  b 


(C , Sub )  . 


Proof : 


The  composite  map 


Var(£)  Var ( £ ) 


)  o  II  II 


9ar (£  ) 


L  -  (y__(7ar(i)  )  ,Sub) 


is  a  model  so  that  because  {G^_  ,p~  )  is  order  preserving 

Var(£)  Var{£) 

so  is  the  entire  map.  In  addition,  analogous  to  Lemma  1,  there  is  a 
bijection  for  each  ordinary  category  c  (or  topos)  expressed  as 
deduction  category  (c,Sub) 

V’(C,*)  :  Mod(Thsynb,bU)  '  (C,Sub))  -  Ar(yii(9ar(f.)  )  ,Sub)  ,  (C  ,Sub) 

(DeCat)  . 

The  following  diagram  commutes: 


Var (i ) 


(C , sub) 


(c , Sub) 


Var ( £  ) 


Var (£)  Var (£) 


■  (»n(?ar(c)  )  ,  Sub) 


7  (C  ,  Sub )  ^ 11  11  (C  ,  Sub )  ^ 
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Thus,  Theorem  1  applies  completely  here. 


Remark. 


The  ideas  developed  through  this  subsection  may  be  extended  (or 
attempted  to  be  extended)  as  follows,  when  category  or  topos  struc¬ 
ture  is  lacking: 

Establish,  for  each  structure  a  to  be  considered,  a  bijection 
¥  :  Mod(Th(£),a)  -»  Map($ar  (£  )  ,a> ) 

where  "Map"  in  some  natural  sense  replaces  arrows  and  is  an  order 
preserving  class.  Then  all  we  need  consider  (or  we  can  define  to  be 
considered)  is  the  identity-like  model  n  ,  for  relating 

Var {t ) 

properties  of  L  with  properties  of  a  . 


(c)  Examples  of  logics 

In  the  following  examples,  we  consider  only  semantic  evalua¬ 
tions  of  the  form  11  n  :  l  -*  ( SET ,  H*  )  .  Moreover,  of  the  seven  types 
of  logics  briefly  considered,  only  one  -  Probability  Logic  -  is  a 
non-truth  functional  one.  In  any  case,  it  follows  that  all  of  these 
logics  involve  relations  and  functions  upon  H  in  the  determination 
of  ii  ii  restricted  to  Loc(£)  and  Quan(£)  .  See  Rescher  [214] 
for  a  more  comprehensive  treatment  of  such  logics. 

1.  Logic(|  |,(*)  =  Lukasiewicz  Logicn  ,  n  *  2,3,4, ...  .Rq.Kj  . 

L  -  Lukasiezicz  Logic 
n  n 

i_2  =  Classical  two-valued  logic  . 

Define : 


[0,1]  (unit  interval) 

I  =  set  of  all  rationals  in 
n 

.  {0 , 1/ ( n-1 ) , ....  1)  ;  n  = 

where  H  =  I  =  truth  space  of 
n 

Now  consider: 


n  =  Nq 
[0,1]  ;  n 

2,3,...  , 


where  Loc  ( £  2 )  =  (nt,  =>  ,  3) 

with  compound/eliminative  definitions: 


(yl 

CM 

> 

for 

(¥1  =>  ¥ 

(yl 

A  ¥  2 ) 

for 

nt ( nt¥ 

(¥1  «=»  ¥2) 

for 

(¥1  z»  ¥ 

(V) (¥1) 

for 

nt ( 3 ( nt 

2)a(y>2  =>  ) 


for  any  ¥ , 


¥2  e  Wff(t2)  ^  Wff  U2) 


slightly  abusing  notation. 
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We  do  not  consider  because  of  closure  problems  in  truth 

0 

evaluations . 

Truth  space  (t2)  =  range  cf  semantic  truth  functions  of 

l2  =  {0,1}. 


Semantic  Truth  Func(i_2) 

=  {Until  ,  ii&ii  ,  Horn  ,  ii=»ii  ,  n «=» u  ,  ii3ii  ,  uvti } 
where  for  all  a  ,y  e  Wff(L2)  , 

11  nt { 8  )  11  =  until  (lieu  )  (lien  <=  {0,1}) 

=  1  -  lien 

11  e  a  ? n  =  min{  n  e  n  ,  11  mi  ) 
n  e  v  yn  =  maxii  a  11  ,  11  mi  ) 

11  e  :»  yn  =  n  ( nte  )  or  yn 

=  max(  1 — 11  e  ii  ,  nyn  ) 

ne  «=»  yu  =  min(max(  l-neii  ,  nyn)  ,  max(l-nyn,  nan)  , 

pointwise . 

n(3a)yilu  =  sup  {inPiiu,  |  »•  e  G (a,u)(y)}  , 

"<Va)rilu  =  inf  {iiynu,  |  »•  €  G (a. **){*)}  , 

where 


G(a,u)  _  (u>  |  »•  individual  variable  assignment  such  that 

u • (y)  =  »(Y) •  Y  €  TV(y )  a  (a)  } 

.  arbitrary  ;  y  =  c; 

Note  that  validity  and  satisfaction  in  classical  logic  are 
directly  connected  because  of  the  simple  truth  sp*<-»  I  =  {0,1}  : 

For  rlfZ2  S  Wff(L2)  ,  1.  ^  z2  iff  whenever  z 

satisfiable  by  any  n  11  e  1. 


Z-  1"L  z2 
c2  * 

hen  so  is  z2  ,  etc. 


Basic  Theorems  for  > 

2 

For  all  <p.r  €  Wf  f  ( L  2  ) 


(see  also  t.  ) 


( i ) 

n  <pn  s  nyn 

,  iff 

<t>  w 

T  11  11 

y 

iff 

11  (<j>  =» 

y }  11 

and 

n  <f>ii  =  nyn 

iff 

<P 

y 

11  n 

f  f 

11 1  «=» 

yn 

for  all  11  11  €  t 
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(ii)  y  is  contradictory  iff  t  is  model-inconsistent. 

(iii)  ?  is  a  tautology  in  i_2  iff  nt(if)  is  contradictory  in 


(iv)  When  we  add  various  basic  theories  Th(L2)  to  i_2 


we  can 


obtain  (deductive)  soundness,  (deductive)  completeness,  logical 
completeness,  logical  soundness,  consistency,  and  that  ThU_2) 


axiomizes  (_2 
Maydole's 

4  QIi, 


In  particular,  (see  Maydole  [172],  pp.  50-54) 
is  such  a  theory,  with  five  axioms  (all  in  formal 


tautology  form  (Hr))  and  two  deduction  rules  (modus  ponens  and 
universal  quantification).  (See  also  2.3.4(b)  for  a  listing  of 
these  axioms  and  rules  A1  -  A5 ,  Rl,  R2 . ) 

Note  that  for  t_2  ,  all  weak  and  non-weak  properties  coincide, 

including  validity  and  satisfaction,  due  to  the  simple  truth  space 

(0,1). 


An  excellent  treatment  of 


i_2  may  be  found  in  Curry  [44]. 


Enderton  [54]  and  Lightstone  [156]  also  present  comprehensive  stu¬ 
dies  of  classical  logic,  together  with  set  theory  models,  Including 
-  via  the  compactness  property,  in  effect  -  Robinson's  nonstandard 
numbers . 

For  r  =  2,3,4,...,  (finite),  is  the  same  as  L2  except 

for  truth  space  (t_2)  =  {0,1}  is  replaced  by  truth  space  of 

Ln  "  Xn  • 

Basic  Theorems  on  l_n  ,  n  =  2,3,  . .  .  ,VtQ,  . 

(i)  For  all  n  :  for  all  9  ,r  e  Wff(i_n)  ,  (abusing  notation 
slightly) 

If  (e  =>  v)  ,  then  9  implies  ?  relative  to  Ln  . 

n 

Note  that  for  all  9  ,  ip  , 

(9  a  if) 
n 


max  (  1  —II 9  ii  ,  ihfii  )  =  1 


»  =  l  or  u 9 »  =  0  . 

(ii)  For  all  n  :  for  all  if  €  Wff(t  )  ,  if  if  is  contra¬ 

dictory  in  L  ,  then  if  is  model-inconsistent  in  t 


(iii)  For  all  n  :  for  all  T  e  Wffft^)  ,  if  (t  )  then 

K  (?)  ,  which  in  turn  implies  ?  . 

cn  ~n 
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(iv)  Weak  compactness  theorem  holds  for  all  L  •  i  e 

n 

v  2  s  Wff(>^)  ,  2  is  satisfiable  (for  some  model)  iff  v  2'  s  2 
r'  finite,  2'  is  satisfiable  (for  some  model). 

(v)  The  weak  Lowenhem-Skolem  property  holds  for  all  b 

i.e.,  for  any  2  s  WffU^)  ,  if  2  is  satisfiable  then  there  is  a 

countable  semantical  domain  and  a  corresponding  model  which  also 
makes  2  satisfiable  (analogue  of  Lindelof  covering  property). 

is  not  axiomiat  izable  and  hence  any  Th(L  ) 
b  ^1 

cannot  be  both  logically  sound  and  logically  complete  for  (weak 

sense,  here).  Hence,  if  Th( L  )  is  logically  sound  for  ^  it 

1  H1 
cannot  be  (deductively)  complete  for  L 

*1 

(vii)  For  any  €  Wff((_n)  ,  define 

2  V)  =  $  (<f)»  (;»  •••($:»?)...))  , 

^ - —  •  ■  ,  -  ^ 

n  terms 

Then 


ii<{)  =»  ipii  =  min(  1  ,n(  l-n«n  ;  +  nyii  ) 
for  all  ii  ii  e  i  . 


(See  Maydole  [172],  pp.  105-111,  for  further  facts.) 

See  the  paper  of  DeGlas  [45'],  where  the  Stone  representation 
theorem  is  extended  to  provide  a  semantic  basis  for  Lukasiewicz  and 

theory  giCS’  WhlCh  in  turn  can  be  used  as  a  basis  for  fuzzy  set 


2.  Logic (Z)  *  Bochvarian  Logic.  B  =  n  =  2  3  A 

•  •  •  • 

Syn(i)  -  same  essentially  as  for  i2_  Loc(2)  =  (nt,  3,  or,  &, 
with  V  a  compound  or  eliminative  definition. 


lint  (  0  )  ii  =  l  -  ii  e  ii 

fmin(l,  neii  +  nyi 


118  3  P  II 


z  a 

n 


n  -  l 


,  if 


l 

2 

n 

-  2 

■  1  *  n 

-  1*  •  * 

’  *  n 

-  1* 

1 )  . 

II  c=»  II  , 

ll&n  , 

ii  om 

i  ,  II  3  II 

,  II VII  ) 

f  II 8 

ii  ,  il  p  ii 

€  ( 

0,1) 

II  VII 

otherw 

ise 

and 

n  odd , 

,  II  P  II 

othe 

rwis 

e  and 

n  evi 

r  II  0  II  ■  M  VII  iff  lien,  II  TP  ||  €  (0,1) 

1  Zn  if  otherwise 


H0  &  ¥MI 
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For  B  ,  everything  is  same  as  above,  except  Truth  Space 
1 

(B^  )  =  [0,1]  ,etc.  (See  also  Maydole  [172],  p.  125.) 


3.  Log|(  „ (2)  =  Gddel-Intuitionistic  logic  Gn,n=2,3,4,...,  . 

Sem  truth  func  { Gn )  =  {until,  ii=»ii  ,  h&ii  ,  Horn,  ii«=>n  }  ;  and  can 

be  considered,  also. 

Note  that  here,  the  truth  space  is  a  Heyting  algebra  H  .  More 
specif ically ,  H  is  a  fixed  complete  Heyting  algebra,  i.e., 

(a)  H  is  a  lattice  -  a  partially  ordered  set  with  order  <H  , 

say,  over  H  x  H  ,  which  is  reflexive,  autisymmetric  and  transitive, 

such  that  operations  a  ,  v  :  H  x  H  -•  H  exist,  where  for  any 

x,y  e  H  ,  x  a  y  is  the  meet  (minimum  or  greatest  lower  bound)  and 

x  v  y  is  the  join  (maximum  or  least  upper  bound) ,  having  the  unique 

properties 

(i)  x  a  y  <H  x,y  . 

(ii)  For  all  z  e  H  ,  if  2  x,y  -  then  z  sH  x  a  y  . 

( iii )  x,y  x  v  y  . 

(iv)  For  all  z  €  H  ,  if  x,y  z  ,  then  x  v  y  s  z  . 

It  follows  that  a  and  v  are  idempotent,  commutative,  asso¬ 
ciative  (and  hence  extendable  to  any  number  of  arguments  recursively 
unambiguously)  and  absorptive. 


(b) 

H 

has  a  greatest  element,  say, 

xa 

or 

th  (or 

simply  1 ) 

relative 

to 

and  a  distinct  least  element, 

say 

°H 

or  i„  (or 
n 

si 'r.ply 

0) 

relative  to  ,  with 

°H 

<  1H  .  x  a  0H  =  °H  -  X  v  0H  = 

X  , 

X  V 

=  1 

H  : 

all 

X 

€  H  . 

(c) 

H 

is  a  complete  lattice,  i.e. 

in 

addition 

to  being  a 

lattice , 

if 

C  £  H  ,  then  there  exist 

xc  • 

e  H 

such  that , 

abusing  notation 

(i) 

XC  C  *H  ' 

(  ii) 

If  Zj ,  z2,  €  H  and  z^  s  z 2  , 

,  then 

2i 5  xc  5  2 2  ' 


i.e.,  xc  and  y^  are  greatest  lower  bound  and  least  upper  bound 
elements,  respectively,  for  C  relative  to  H  .  (Of  course,  xc  , 
yc  *  C  ,  in  general . ) 
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(d)  H  has  the  Hey  t  i  ng -Brouwer  property  : 

For  any  u,v  e  H  ,  define 

Cuv  =  {z  |  z  e  H  &  u  a  z  SH  v}  . 

Then,  using  the  notation  from  (c)(ii),  let  y_  be  the  least 

Cu,  v 

upper  bound  element  in  H  for  Cu  .  It  is  then  assumed  that 
yc  €  Cu  v  '  *,e>'  Cu  v  ^as  a  maximal  element  in  it  with  respect 

to  order  . 

Define  also 


(u 


V)  3  y 


C  "  v*Cu,v> 
u,  V  ' 


called  the  pseudo-complement  of  u  relat'ive  to  v  or  the 
i ntu i t i on! st i c  Implication  "if  u  then  v  We  can  also  define 

the  double  implication  «=»  in  the  obvious  way: 

u  «=>  v  for  (u=>  v)  a  ( v  uj  . 

It  follows  that  H  is  also  distributive  for  a  over  v  and 
vice-versa  for  any  index  set.  In  addition 


(i) 

For  all 

x  «  H  , 

(X  r,  x) 

(ii) 

For  all 

x  €  H  , 

nt  (x)  = 

(-X)  =  (X  =» 

is  the  pseudo-complement  of  x  relative  to  0„  or  the  intuit ionist 

n 

negat i on  . 


Two  basic  examples  of  a  complete  Heyting  algebra  are: 


Example  1 .  Any  totally  ordered  complete  chain  H  such  as  a  closed 
interval  in  R  -  in  particular,  HQ  *  [0,1]  .  In  this  example,  for 

any  x,y  «  H  , 

f  Y  '  if  x  >h  y  {i-e-  *  *h 
*  y)  ■  l  *K  ■  1!  **«  y  • 

and  hence 

f  0„  ,  if  x  >u  0U  (including  x  =  1„) 
nt  (x)  =  (~x)  =  |  H  H  H  H  . 

[  1H  -  if  x  =H  0H  . 

Example  2.  Consider  any  Boolean  algebra  H  i.e.,  H  has  two  bi¬ 
nary  operations,  say  a,v  :  H  x  H  -•  H  and  a  unary  operation 
C  :  H  -«  H  such  that  v  and  a  are  symmetric,  associative, 


v 
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finitely  distributive  over  a  and  vice-versa  and  contains  0H  and 
1H  such  that  for  all  x  e  H  , 

x  a  0H  =  °H  :  x  v  °H  =  x  , 

X  A  1H  =  X  ;  X  V  1  =  1H  , 

x  a  cx  =  0H  ,  x  v  cx  =  1H  ;  0H  =  C1H  . 

Then  define  partial  order  s  by  for  any  x,y  e  H  , 

n 

(x  SH  y)  iff  x  a  y  =  x 

iff  x  v  y  =  y  . 

Also  define  =»  by,  for  any  x,y  e  H  , 

(x  »  y)  =  (Cx)  v  y  . 

Thus,  H  is  also  a  Heyting  algebra.  Also,  H  may  be  a  Boolean 
o-algebra,  i.e.,  H  is  also  o-complete  -  for  any  at  most  countable 
set  C  =  {Xj.Xj,...}  s  H  ,  xc  ,  and  equivalently,  yc  e  H  ,  and  hence 

H  is  a  o-complete  Keyting  algebra,  etc.  Usually,  H  is  interpreted 
as  a  collection  of  subsets  x  of  a  space  X  ,  where  the  null  set  0 
is  identified  with  0^  ,  X  with  1H  ,  n  with  a  ,  u  (or  symmetric 

set  difference  A)  with  v  ,  and  C  with  ordinary  set  complemen¬ 
tation  relative  to  X  . 

In  addition,  we  can  add  an  equivalence  relation  over  H  x  H 
together  with  two  other  associated  relations:  Let  e  :  H  x  H  -♦  H 
be  arbitrary  such  that  Rj  Th  lg  and  Rx  Th  ig  ,  of  Appendix  2, 

section  2.4  hold  for  0  replacing  *  there: 

8 (x,y)  =  0 (y,x) 

0 (x,y)  a  e (y,z)  s  0 ( x , z )  , 

for  all  x,y,x  «  H  .  Thus,  e.g.,  we  can  choose  e  =  a  or  0  =  a„  , 

H  H 

etc . 

Define 

D ( 0 )  :  H  -  H  by 

D(0)(x)  =  0 ( x , x )  ;  all  x  e  H  , 
and  finally,  in  turn,  define  eg  :  H  x  H  -►  H  by 

eg(x,y)  3  ( D  ( 0  )  ( x)  vH  D(0)(y))  =>  0(x,y)  , 

for  all  x,y  e  H  . 

Some  specific  examples  of  9  ,  D(0)  and  efl 

plete  chain: 


for  H  a  com- 
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(iii)  e(x,y)  =  8(H),  .  (ordinary  Kronecker) 

v  ,  y ; 

TH  iff  x  =H Y 

iH  iff  x  *Hy  ;  x,y  €  H 


e,(x,y)  =  a(H)(x#y)  «  all  x,y  €  h  . 

It  can  then  be  shown  that  eg  is  an  equivalence  (or  equality) 

relation  in  the  sense  that  Rj  Th  -  Rj  Th  21  of  Appendix  2, 

section  2.4  all  hold  with  *  replaced  by  eg  , 

pro  j  1  { ix  i )  =  (proj1(i)  (ixi)  =  xj*^)  by  x  ,  proj2(ixi)  by  y  , 

etc . 

For  the  basic  properties  of  0  ,  D(0)  and  efl  ,  couched  in 

terms  of  Higg's  topos  and  related  structures,  see  Appendix  3  (iv). 

Indeed,  all  of  the  axioms  and  rules  given  in  Appendix  2, 

section  2.4  for  a  ,  v  ,  9  ,  =  ,  etc.,  are  all  (generalized)  intui- 
tionistic  logic  ones  and  are  compatible  for  the  case  considered 
here  -  H  .  (See  also  2.2.2  (VI)  for  theorems  one  can  derive  for 
a  ,  v  ,  »  ,  *  ,  etc . ) 


4.  Probability  Logic (PL) 

Probability  logic  (PL)  is  a  non-truth  functional  logic  over 

l  . 

We  present  here  a  brief  outline  of  PL.  Additional  properties 
for  PL  and  related  logics  may  be  found  in  section  2.3.9  (A). 

In  this  case,  we  choose  H  =  £0,1]  (unit  interval)  as  the 

truth  space. 

For  each  i  ,  let  Xi  e  Ob(SET)  and  11  in  £  ^(X^)  be  a  sigma 

algebra.  For  each  (f  :  i  -*  j)  «  Var(Ar (£))  ,  u  f  11  :  11  ill  -*  11  jii  and 

for  each  r^  «  Var(Rel(i))  ,  with  o(r^)  =  i  ,  let  iir^i  e  [0,l]nil1 

be  such  that  nr^n  is  a  probability  measure  and  hence  for  any  indi- 
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vidual  variable  assignment  function  w  and,  say,  7  =  r^[x^]  , 

iiipii^  =  HrJl(Aw)  e  CO/1]  ,  where  Aw  =  w(x^)  «  n  ill  ,  etc. 

As  usual,  for  any  a,p  e  Typ(i)  ,  and 
(F  :  a  -*  J3)  €  FuncSymb ( L )  ,  IIFU  :  Hall  -*  H/3II  ,  with  Hall  =  HVar(a)ii  , 

etc . 

In  particular,  consider  the  logical  and  quantitative  aspects  of 
ii  ii  ,  noting  here  we  let 

Rel(c.a)  =  u  CO,!]"1"  , 

i€Var(0b(i ) ) 

Until  :  Rel(c,»)  Rel(e,»)  , 
so  that  for  any  7  e  Wff^j(Z) 

ii nt (y ) ii  =  until  (inpii )  =  l  -  impii  e  [o.l]111"  . 

Hence,  for  any  A  e  iiin  ,  denoting  CA  =  n  A  , 

ii  nt  (if )  n  (A)  =  Until  (ill’ll )  (A) 

=  1  -  iiv-ii  (A) 

=  impii  (  ca) 

Next,  consider 

x  :  Var (Rel (  / )  )  x  Var(RelU))  -  Var(Rel(£))  , 

where  for  any  ri*sj  €  Var { Rel (£))  ,  it  Is  assumed 
ri  x  sj  €  Var(Rel(i))  with  o(r^  x  s^ )  =  i  x  j  .  Then 

ii  x  n  :  Rel  (c  ,»)  x  Rel  (c  ,a )  Rel(C,a) 

is  defined  so  that  for  any  7,9  e  Wff(i)  ,  HiPxeii  =  (iiipii  iixii  lien)  is 
some  choice  for  a  joint  probability  measure,  given  the  marginal  pro¬ 
bability  measures  iir.ii  and  ns.ii  . 

i  3 

Also  define 

Util  :  Rel(C,»)  x  Rel  (C  ,»)  Rel(C,»)  , 
so  that  for  all  7,9  e  Wff(t)  , 

ii  ip  1 8  ii  =  ii?  ii  ii  t  ii  lien 

=  impii  +  lieu  -  iivh  ii x ii  lieu  , 

noting,  because  of  the  range  of  H  n  being  in  [0,1]  ,  all  arith¬ 

metic  operations  as  above  are  well-defined. 

In  particular,  define 
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II &II  .Horn  :  Rel(c,»)  x  Rel(c,»)  Rel(C,!R)  , 
for  all  y,0  «  wff(i)(^)  '  and  a11  A  €  11111  ' 

ii y  &  011  (A)  =  ( ii ipii  ii&ii  ii0ii  )  (A) 

^  (iiyil  iixii  ii  0  H  )  { Ax  A )  . 

For  any  y,0  €  Wff^j(A)  , 

iiy  or  0ii(A)  =  (iiyil  Horn  H0H)(A) 

=  ( II  ip’ll  II  til  II 0  H  )  ( AxA ) 

=  Hip'll  (A)  +  11011(A)  -  (liyii  ii&ll  II 0 ll  )  ( A )  . 

It  also  easily  follows  that  for  any 
y  «  Wff(  (t)  ,  0  €  wff  (t.)  ,  any  A  «  ii  ill  ,  B  €  iijn  , 

(  ii  y  ii  ii  x  ii  ii  0  u  )  ( At  B )  =  {  ii  y  ii  ii  t  ii  ii  0  ii  )  ( Ax  B )  , 

where  A  t  B  =  (AxX^)  u  (X^xB)  ,  etc. 

In  addition,  one  defines  for  any  r,8  e  Wff^j(i)  , 

uy  zt  on  =  ( ii y ii  ii=»ii  ii 0ii  ) 

=  ii  (  (nty )  or  0  )  ii 

=  iintyn  Horn  ll 011  , 

iiy  «=»  0ii  =  ii  (y  =►  0)  &  (o  »  y)u 

=  (  ii  y  ii  ii  ^  ii  ii  a  ii  )  ii&ii  (  ii  0  ii  ii  =»  ii  n  y  ii  )  , 

and  the  quantifiers  are  evaluated  as  for  any  y  e  Wff^x..j(£)  , 

,  as  for  classical  logic 

ix  j 

ll  (Vx^  )ynw  =  inf  {Iiy»ul  I  all  w-  €  G  (x(  1 1  ,w  )  (y ) } 

11  (3xix j)V"w  "  sup  {"yllu.  I  a11  w’  €  G(x(1)  ,u>)  (y)  ) 

where 

( d. )  d 

G(x  ,w)  =  (u  •  |  ,i  •  individual  variable  assignment  such  that 

U  (y)  ;  ye  TV(y)  H  ( x  ^  1 
.  arb  ;  y  =  x^ 1 ^ 


w  •  (y) 
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It  can  be  shown  that  for  any  €  > 

IMP  1 II  =  IIV^II 

iff  7  and  r  are  equivalent  for  L 2  . 

Finally,  note  that  PL  is  not  truth-functional,  since  there  is 

2 

no  function  f  :  [0,1]  -*  [0,1]  such  that  for  all  individual 

variable  assignment  functions  u  ,  and  all  7,0  e  Wff(z) 

117X011  =  ( II  711  II  x:  II  II  0  II  ) 

o  '  '  o 

=  f„  .,(11711  ,  II 011  ) 

II  xll  '  w  to ' 

or  equivalently,  for  all  A  x  B  e  ilixjli  , 

117x011  (AxB)  =  f||x||  (II7II  (A)  ,  11011(B))  . 

In  particular,  there  is  no  g||&||  :  [0,1]  -*  [0,1]  such  that  for 
all  7,0  and  o  , 

II7&0II  =  (II 711  ll&ll  II 011) 

to  '  'to 

=  g,i&.,(|iyl,U'  11 0,1  J  • 

See  section  2.3.9  for  connected  results. 

5.  Standard  Sequence  Logics.  (See  Maydole  [172],  p.  130-132.) 

6.  Modal  Logic.  (Rescher  [214],  pp.  188-197). 

Essentially,  modal  logic  is  characterized  by  a  collection  of 
unary  logical  connectives,  for  example,  the  three  alethic  modal 
operators:  Necessity,  Possibility,  and  Actuality/Assertion.  The 

corresponding  logic  can  be  developed  by  specifying  unary  truth- 
functional  tables.  In  addition,  of  course,  other  logical  con¬ 
nectives  are  used  such  as  not,  and,  or,  =»  ,  etc.,  and  in  turn, 
compound  or  eliminative  logical  connectives  can  be  defined  in  turns 
of  these  connectives.  Also,  theories  have  been  established  char¬ 
acterizing  modal  logic.  All  these  logics  are  special  cases  of 
multi-valued  logics.  (See  also  [245].) 

7.  Generalization  of  truth  functional  logics  using  negation  opera¬ 
tors,  t-norms.  t-conorns.  (See  section  2.3.4  for  a  comprehensive 
presentation. ) 


2.3.4  Set  theory  for  ■ultl-valued  logic 


(a)  Introduction 


All  mathematical  concepts  and  reasoning  may  be  based  upon  some 
form  of  set  theory.  (However,  see,  e.g.,  Von  Neumann  [257]  for  a 
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primitive  function  approach,  Lake  [149]  for  a  related  procedure,  or 
Gilmore  [74],  for  a  symbolic  operator  approach  replacing  set 
theory.)  In  the  past,  set  theory  has  been  mainly  confined  to  the 
scope  of  classical  two-valued  logic.  In  modeling  problems  of  the 
real  world,  two  directions  present  themselves.  First,  one  can 
attempt  to  make  concepts  -  especially  those  couched  in  linguistic 
terms  -  conform  more  narrowly  to  classical  0-1  logic  by 
"mathematizing"  vague  concepts.  For  example,  "some  apples  are  bad 
in  that  bunch,"  can  be  made  more  precise  by  perhaps  artificially 
specifying  "three-sevenths  of  the  apples  are  bad  in  that  bunch" . 
(But  is  three-sevenths  exact?)  On  the  other  hand  -  the  view  taken 
in  this  monograph  -  is  to  keep  the  ambiguity  or  vagueness  of  such 
descriptions,  but  consider  the  corresponding  truth  evaluation  not 
necessary  restricted  to  T  (or  1)  and  i  (or  0)  ,  but  rather  to 
lie  within  the  unit  interval.  Correspondingly,  set  theories  for  a 
given  natural  language,  and,  in  turn,  using  the  Parsing  Principle, 
for  a  related  formal  language,  may  be  established  also  within  a 
multi-valued  truth  setting.  An  application  of  this  is  the  estab¬ 
lishment  of  knowledge-based  systems  from  both  experts  and  analytic 
considerations ,  from  a  multiple  truth  viewpoint.  (See  Chapter  8). 

In  the  following  development,  L  is  any  fixed  formal  language 
and  L  =  Logic(£)  is  some  fixed  logic,  unless  otherwise  stated. 
Initially,  a  list  of  the  most  common  axioms  used  to  develop  set 
theories  will  be  presented,  followed  by  a  list  of  compatible  logical 
axioms  needed  to  develop  the  set  theories  for  various  logics.  Then, 
in  addition,  deduction  rules  are  added  to  the  various  axiom  sets  to 
complete  the  theories,  eight  of  which  are  summarized  below.  Final¬ 
ly,  a  brief  survey  of  basic  relations,  properties  and  problems  of 
such  set  theories  is  presented. 


(b)  Selected  set  theories 


The  following  list  contains  (by  no  means  exhaustive)  33  of  the 
most  common  set  theory  axioms  used  to  form  various  theories  of  sets. 
Only  the  first  three  axioms  will  also  be  presented  in  some  detail. 

For  symbolizations  of  the  remainder  and  further  details,  see, 
e.g.,  Maydole  [172],  Cohen  [39],  Vopenka  and  Hajek  [258],  MacLane 
[165],  Fraenkel  and  Bar-HIllel  [62],  Devlin  [46''],  Cohen  and  Hersh 
[40],  Suppes  [249],  Pareigis  [202]  (Appendix),  and  the  discussions 
of  Cohen,  Hajek  and  MacLane  in  [4'].  All  of  the  above,  except 
Maydole,  refer  to  classical  0-1  logic. 

Before  presenting  the  axioms,  four  distinguished  binary 

function  symbols  in  Foun(L)  are  reintroduced  from  section  2.2.2: 

( set )  abstraction  {•[••)  :  0b(£)  x  0b(£)  x  Rel(x)  -*  Ar  (£) 

{ set )  membership  e  :  0b(£)  -»  Rel(i) 

( set )  exponentiation  n  :  0b(£)  -»  0b (£) 

( set )  equivalence  *  :  0b(£j  -♦  Rel(£)  . 

Note  that  all  of  the  axioms  presented  below  are  in  the  form 
(HP)  ,  for  some  r  e  Wff(i)  .  (See  also  section  2.3.3  (a)  (viii) 

for  a  discussion  of  the  different  ways  axioms  may  be  presented.)  In 
addition,  only  e  and  *  appear  explicitly  in  the  axioms,  while 
the  class  abstract  operator  {•!*•>  Is  avoided  by  use  of  the  con¬ 
textual  definition  approach  via 


<a|!P(a)}  for  (3/J)(Va)((a efl)  <=>  iP(a)) 
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where  a  e  Wfv.(i)  ,  B  e  Wfv  .  ( i )  ,  and  r  e  Wff.(i)  ,  for  the 
1  a1  1 

various  concepts  -  such  as  unions,  intersections,  ordered  pairs, 
etc.  -  produced  by  the  set  theory  axioms.  (See  the  discussion  in 
[172],  p.  70-76.)  Furthermore,  (compatible  with  the  comments  in 
[172],  p.  248)  we  can  evaluate 

n  (a  |y  (a)  >ii  =  (ii  ill  ,  iivii )  , 

where  imi  :  Mill  -»  H  ,  for  example. 

The  basic  table  of  axioms  is  as  'ollows: 

Axiom  Un  Universal 

(3«)  (V/5)  (  (pea)  <=>  p  *  p))  , 

where  e  and  *  are  abbreviated  symbols  and  p  e  Wfv^i)  ,  for 

some  i  e  Var(0b(£))  .  Individual  variables  here,  a  ,  p  ,  i  ,  .... 

are  considered  sets  in  a  general  sense. 

Axiom  21 (El)  Extension 

I-  (Va  )  (V/3  )  (Vj  )  (  (y  eac=»iep)^a«p)  . 

Many  modifications  of  this  axiom  exist.  (See  Maydole  [171],  p. 
66,  60,  63,  214,  222,  223.) 

Axiom  GC  Generalized  Comprehension 

(Va1)-  ■  (Van)  (3a)  (V/5)  (p  €  a  «=»  rlp,a,a1 . aR)  )  , 


for  all  tp  e  Wff{4)  . 

Thus  formally,  any  attribute  y  yields  a  corresponding 
generalized  set  a  .  Many  variations  or  weakenings  of  GC  exist, 
including  Abstraction  (see  Maydole  [172],  pp.  65,  et  passim).  (See 
also  subsection  (d)  for  further  discussion. ) 

Also,  note  the  following  axioms: 


Abbreviation 


Axiom  Common  Name 


22 

23 

24 

25 

26 

27 

28 
29 
210 
211 
212 

213 

214 

215 

216 

217 

218 

219 

220 


Null  set 

Unordered  pairs  (union  of  pairs) 

Arbitrary  union 

Infinity 

Replacement 

Power  set 

Regularity  (Russell  Paradox  exclusion) 
Choice 

Continuum  Hypothesis 
Ordered  pairs 
(Relative)  Complement 
Cartesian  product 
Membership  relation 
Domain 

Permutation  (conversion) 

Foundations 

Restricted  Comprehension 

Transitivity 

Mostowski 
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221 

Singletons 

Z22 

Intersection 

Z23 

Existence 

Z24 

Restriction 

Z25 

Range 

Z26 

Relations 

Z27 

Nontriviality 

Z28 

Functionality 

Z29 

Double  replacement 

Z30 

Separation 

Z31 

e  -  induction 

Z32 

Collection 

Z33 

Double  complement 

(A)  A  selected  list  of  logical  axioms  used  in  developing  various 
set  theories 


For  all  e,  r ,  rj  e  Wff(t)  ;  all  a  e  Var(£)  ,  -*  an  Impli¬ 

cation  operator,  etc.: 

Abbreviation  Axiom  Name 


A1 

Deduction 

9  z»  (y  *  g )  . 

A2 

Distribution  of  implication  over  implication 

y- 

(9  a  (y  =»  17))  ■»  ({9  =»  y )  =»  (9  ^  77))  . 

A3 

Modus  Tollens 

V- 

(nt  y  »  nt  8)  j  (9  =,  y )  . 

A4 

Distribution  of  Universal  Quantification 
Implication 

over 

( Va  )  {  (  ( 9  *  y  (x)  )  =»  (9  =,  (  (vx)y(x)  )  )  )  , 

a  bound  in  8  . 

A5 

Universal  instantiation 

h 

( (va)y)  =»  9  , 

where  9  is  like  y  with  a  replaced  by  /s 

,  etc 

A6 

Equivalence 

h 

( Va ) (a  x  a  )  . 

A7 

Substitution 

t- 

( Va  )  ( V/5 )  (  (a  -/»)=,  (8(a)  «  8  (/>)))  , 
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where  a  (at)  is  like  9  (p)  with  p  substituted  for 
a  whenever  a  is  free. 

See  Maydole  [172],  p.  50,  et  passim.  Hay's  Logic  Axioms  (HI) 
-  ( H9 ) ( ( HI )  =  (Al)  ,  ( H2 )  =  ( A2 ) (modified)  ,  (H4)  =  (A3))  are 
discussed  in  Maydole  [172],  p.  225,  et  passim,  including  various 
properties  for  Logic(t)  =  that  hold  when  (HI)  -  (H9)  are 

assumed . 

Next,  two  inference  rules  are  listed  below  which  are  used  in 
conjunction  with  various  collections  of  set  theory  and  logical 
axioms  to  form  set  theories. 


(B)  Basic  inference  rules  use  in  set  theories 

Abbreviation  Inference  Rule 


R1  Modus  Ponens 

If  h(e  =»  T)  and  t-  9  then  l-  *  . 


R2  Universal  Instantiation 

If  v-  9  ,  then  t- (Va ) e  . 


(C)  Some  basic  set  theories 

To  each  of  the  following  eight  axiom  collections,  A x(£)  ,  may 

be  added  Rul(t)  =  { ( R1 ) , { R2 ) )  (with  some  redundancy  possible)  to 
form  eight  corresponding  set  theories  Thx(£)  >  where 

x  =  NST ,  MH,  NGB,  ZF,  IZF,  Ch,  Wei,  Nov. 

1.  NST  -  Naive  Set  Theories  ([104],  [172]) 

a.  (Al)  -  (A5)  ,  (Zl)  ,  (Z9)  ,  (GC)  , 

b.  (Al)  -  ( A7 )  ,  (Zl)  ,  ( Z9 )  ,  (GC)  with  an 

identity  assignment  (valuation)  constraint. 

2.  MH  -  Mavdole-Hay  Set  Theories  ( T 1 7 2  ]  ,  Chapter  6). 

a.  ( A6 )  ,  (HI)  -  ( H9 )  ,  (Zl) (modified) (GC)  , 

b.  (HI)  -  ( H9 )  ,  (Zl) (modified) (GC)  with  an 

identity  assignment  constraint. 

3 .  NGB  Von  Neumann-Godel-Bernays  Set  Theory  [ 4 ' ] . 

(Zl)  -  ( Z8 )  ,  ( Zl 2 )  ,  ( Zl 4 )  ,  ( Zl 6 )  ,  (Z22)  - 

(Z25)  ;  with  some  modifications. 

4.  ZF  Zermelo-Fraenkel  Set  Theory  [249]  .  [4'],  [62]. 

(Zl)  -  ( Z8 ) 

Optionally,  add  (Z9)  and/or  (Z10)  . 
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IZF  Intuitionistic  Zermelo-Fraenkel  Set  Theory  [9?‘i. 

a.  Grayson  (Zl)  ,  <Z3)  -  (Z7)  ,  (Z30)  -  (731)  , 

b.  Grayson  (Zl)  ,  (Z3)  -  (Z7)  ,  (Z30)  -  (Z32)  , 

c.  Powell  (Zl)  ,  ( Z3 )  -  (Z7)  ,  (Z30)  ,  (Z31)  ,  (Z3i) 

CH  Chapin's  Set  Theory  (T321,  [33]) 

Here,  e  Is  a  trinary  predicate  symbol,  between 
an  element,  a  set,  and  degree  of  membership.  The 
following  axioms  are  thus  appropriately  modified: 

(Zl)  -  ( Z3 )  ,  ( Z5 )  -  (Z9)  ,  (Zll)  ,  ( Z 1 3 )  , 

(Z26)  ,  (Z27)  ;  presented  within  14  axioms. 

Wei  Weidner's  Set  Theory  [270]. 

Similar  restrictions  hold  for  e  as  in  Chapin's 
Theory : 

(Zl)  ,  ( Z3 )  ,  ( Z5 )  -  (Z8)  ,  ( Zl 1 )  ,  (Z2C)  .  (Z29)  , 

appropriately  modified. 

Nov  Novak's  Set  Theory  f!95]. 


Again,  <=  is  as  in  Chapin's  Theory: 

(Zl)  ,  ( Z3 )  ,  ( Z7 )  ,  ( Zll )  ,  ( Z12 )  ,  (Z14)  ,  (Z15) 

( Zl 6 )  ,  ( Z18 )  ,  (Z23)  ,  (Z25)  ,  Un  . 


,  (Z22)  , 

77)  . 

(2)  For  Maydole-Hay ' s  theory  relative  to 

Logic(z)  =  :  all  ZF  axioms,  except  possibly 

for  ( Z 5 )  are  theorems  in  .  ([172],  p.  241). 

(3)  (i)  Zermelo-Fraenkel  Theory  (Zl)  -  (Z8)  is  con¬ 

sistent  if.f_  NGB  is  consistent. 

(ii)  It  is  not  known  whether  ZF  is  consistent 
or  not. 

(iii)  Essentially  all  ( set-theoretic )  theorems  of 
ZF  and  NGB  coincide. 

(iv)  If  ZF  is  consistent,  then  it  is  not  com¬ 
plete  . 

(v)  ( Z 9 )  and  (Z10)  are  consistent  with,  and 

independent  of,  .(ZF),  i.e.,  there  are  various 
models  for  (ZF)  which  make  ( Z9 )  ,  (Z10)  have 

any  combination  of  truth  or  falsity  values 
(P.  Cohen,  1963:  [4'],  [39],  [40]). 


(c)  Basic  relations  among  set  theories 

(1)  For  Naive  Set  theory  relative  to  Logic (t)  = 
( Z  2 )  -  (Z5)  ,  (Z7)  ,  ( Z9 )  ,  ( Zl 2 )  ,  (Z21) 

are  all  theorem  in  (Maydole,  [172],  p. 
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(vi)  <(Z1)  -  (Z3)  ,  { Z7 )  ,  (Zll)  ,  (Z12)  -  (Z16)} 
is  equivalent  deductively  to  {(Zl)  -  (Z4)  , 

( Z7 )  ,  (Zll)  ,  ( Zl 8 )  )  .  (See  Johnstone  [127], 

p.  312-319.) 

(4)  If  ZF  is  consistent  then  Weidner's  Theory  is  con¬ 
sistent  (Weidner  [270],  Theorem  11).  Weidner's 
Theory  extends  ZF  where  the  predicate  membership 
symbol  is  trinary,  accounting  for  membership  levels. 

(5)  Maydole  shows  ([172],  Chapter  I,  II): 

(i)  For  any  NST  for  L2  GC  yields  Cantor's 

paradox  and  hence  NST  is  model-inconsistent 
(for  i_2)  • 

(ii)  For  any  formal  language  t  and  corresponding 
Logic(£)  ,  if  the  following  inference  rules 
involving  evaluations  can  be  shown  to  hold: 


Existential  instantiation  (El) 

Universal  instantiation  (UI) 

Material  equivalence  (ME) 

Simplification  (S) 

Absorption  of  some  order  (A) 

Modus  Ponens  (MP)  , 


then  any  theory  Th(Jt)  containing  also  (GC)  has 
a  Curry-like  paradox  ([172],  p.  121  et  passim), 
and  hence  (GC)  is  model-inconsistent  for 
Logic (4)  .  More  explicitly,  universal  and 
existential  instantiation  may  be  written  as: 

Suppose  II  ii  :  l  -  ( SET ,  H  )  with 
(0,1)  £  H  S  [0,1]  . 


Universal  Instantiation  { UI ) : 

For  any  i  s  V£r(0b(£))  ,  ary  x  e  Wfv^(£) 

and  any  f  e  Wff(i)  : 

If 

H  (vx)?  (x)  ii  =  1 

then  for  all  t  «  =  »iii  «  Ob(SET) 

inf’ll  ( t)  =  1  . 

Existential  Instantiation  ( El ) : 

For  any  i  «  Var(0b{£))  ,  any  x  e  Wfv^(r) 

and  any  r  e  Wff(i)  ; 

If 

ii  ( 3 x ) v  (x)n  -  1 

then  there  is  a  t  e  ;  such  that 


liv’d  ( t)  =  1 
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(iii)  If  Logic(t)  has  a  finite  semantic  truth 

value  range,  universal  and  existential  instan¬ 
tiation  always  are  valid  (provided  (HI)  ■ 

( R2 )  are  inference  rules,  as  usual)  in  any 
theory  Th(t)  . 

(iv)  If  for  any  formal  language  t  and  corres¬ 
ponding  Logic(i)  ,  the  six  inference  rules 
hold  in  (ii)  in  a  weakened  "quasi-"  form  (by 
modifying  for  various  truth  levels  -  see  [172], 
pp.  123,  124),  then  a  Curry-like  paradox  still 
holds  for  any  Th (£)  when  (GC)  is  incluaed; 
and  hence,  (GC)  is  model  inconsistent  for 
Logic (4)  . 

More  specif ically ,  to  illustrate  the 
"quasi"  form,  (QUI)  and  (QEI)  may  be 
written  as: 

Quasi -universal  Instantiation  ( QUI ) : 

For  any  A  e  [0,1]  and  any 
i  e  Var(0b(£))  and  any  x  e  Wfv^t)  ,  and  any 

T  e  Wf f (/. )  : 

If 

II  ( Vx }  V  ( X)  II  k  A  , 

then  for  all  t  «  =  Hill  e  Ob(SET)  . 

H  Vll  ( t)  k  A  . 

Quasi -existential  Instantiation  ( QEI ) : 

For  any  i  «  Var(0b(t))  and  any  x  e  Wfv^(£) 

and  any  V  e  Wff (t)  : 

If 

II  (3x)V  (  X)  II  =  1 

then  for  each  integer  m  k  1  ,  there  is  a 
corresponding  tm  «  X^  such  that 

HVH ( t  )  k  l  -  l/m  . 
m 

(v)  As  a  consequence  of  (ii)  or  (iii)  or  (iv),  Maydole 
shows  that,  except  for  t.,  ,  for  all  other  logics 

1 

considered  ( ,  2  £  k  <  ,  Bochvar,  Prob.  Logic, 

Godel-Logic,  etc.)  when  GC  is  added  to  any  Th(£), 
a  Curry-like  paradox  results  and  hence  except  for 
t_N  ,  Th(i)  ,  when  GC  is  included  in  its  axioms, 

is  mode  I - i neon s i 3 t ent .  (See  [172],  [172'].) 

(a)  We  do  not  know  whether  Th(£)  with  GC  is 
model  consistent  for 

*1 
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(b)  Since  is  not  axiomitizable ,  no  Th(£) 

(including  any  set  theory)  can  be  both  logically 
sound  and  logically  complete. 

(c)  Any  Th(£)  including  both  GC  and  (El)  is 

model-inconsistent  for  with  an  identity 

assignment  construct  ([172],  p.  203). 

(d)  By  suitably  restricting  GC  ([172],  pp .  151-171), 
MH  Set  Theories  (which  all  contain  GC)  are  model- 
consistent  for 

H1 

However,  none  of  these  theories  can  be  both 
logically  sound  and  complete.  Indeed  they  all  are 
deductively  incomplete,  logically  incomplete,  but 
logically  sound.  (See  Maydole's  discussions, 
pp.  236-245.)  On  the  other  hand,  MH  retains  some 
form  of  (GC)  and  is  a  plausible  candidate  for  set 
theory.  (See  [172],  Chapter  V.)  . 

(6)  Chapin's  [32],  [33]  theory  extends  ZF  ;  some  rudemen- 

tary  arithmetic  and  set  theory  relations  are  develooed. 

The  membership  predicate  symbol  is  considered  tr inary, 
hence,  involving  set  membership  level. 

(7)  Novak's  theory  extends  NGB.  Novak's  membership  predi¬ 
cate  symbol  is  trinary  analogous  to  Weidner  and  Chapin. 

Novak,  in  a  different  direction  [195"]  from  the 
above  approach,  uses  the  concept  of  semisets  as  originally 
expounded  by  Vopenka  and  Hajek  [258],  as  the  basis  for  a 
generalized,  and  in  effect,  multiple-valued  logical  set 
theory.  (Roughly  speaking,  a  semiset  =  {x  |  y(x)  s  a) 

where  v  ,-s  a  generalized  (non-set  one)  property  or  pre¬ 
dicate  and  a  is  an  ordinary  set.)  Essentially,  Novak 
first  shows  that  all  semisets  are  monotone  approximable  by 
nested  sequences  of  ordinary  sets,  i.e.,  by  flou  classes 
and  hence  in  turn  by  fuzzy  sets.  (See  section  5.2  (B)  for 
basic  properties  and  background.)  A  number  of  points  of 
difficulty  arise  in  both  the  structure  of  the  approximat¬ 
ing  collections  of  sets  and  set-like  operations  on  them  - 
in  this  approach.  (Compare  with  the  situation  for  elemen¬ 
tary  flou  class  representation  of  fuzzy  sets  by  ordinary 
sets  as,  e.g.,  in  Theorem  4,  section  5.2  (B) . )  However, 
the  approach  does  appear  promising  since  the  entire  formu¬ 
lation  is  carried  out  completely  within  an  axiom-like 
framework,  as  opposed  to  the  usual  situation,  where  fuzzy 
set  membership  functions  are  ideally  exact  numerical 
functions,  estimated  through  empirical  and  psychological 
considerations.  On  the  other  hand,  it  seems  that  Novak 
can  still  not  completely  avoid,  somewhere  along  the  line, 
empirical  subsets  in  the  modeling  of  the  original  wffs 
or  properties  generating  the  semisets  of  interest. 

(8)  For  GOdel-Intuitionistlc  logic,  it  can  be  shown  (see 
Grayson  [ 97 ' ] ) : 

(ZF)  Is  consistent  relative  to  ( IZF ) ( b ) 

(ZF)  Is  consistent  relative  to  (IZF)(c)  . 
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(9)  In  a  related  vein,  it  can  be  shown  that  ([179]) 

(21)  -  (27)  ((26)  (weakened))  are  valid  for  Godel  Logic 

relative  to  Higg(H)  . 

Other  approaches  to  set  theory  tnrough  multi-valued  logic 
include:  Allen  [2‘],  using  the  concept  of  "property  systems"; 

Chang's  infinite-valued  logical  set  theory  [31];  Smith's  "textured 
sets",  a  concept  using  weighted  sums  for  modeling  membership-like 
functions  [244'].  See  also  Klaua's  series  of  papers  in  the  honat  . 
Deutsche  At cad.  Vi ss  .  ,  Berlin,  on  a  set  theory  based  on  multi-valued 
logic,  analagous  to  -  and,  originally,  independent  of  -  2adeh's 
work,  beginning  with  [139]  and  [138'].  Approaches  to  set  theories 
and  properties  of  generalized  sets  may  be  carried  out  through  mem¬ 
bership  functions  only  (unlike  Ch,  Wei,  Nov). 

In  another  direction,  Willmott  [274]  has  investigated  various 
ways  of  defining  norms  and  distances  between  fuzzy  sets  which  Warren 
[266]  considered  the  equivalence  relation  of  shape  (up  to  scalar 
multiples)  of  fuzzy  set  membership  functions.  Kloeden  [143]  inves¬ 
tigated  the  topological  structure  (through  a  Hausdorff  metric 

X 

distance)  of  the  space  of  membership  functions  [0,1] 

See  also  section  2.4.2  (G)  for  a  discussion  of  the  connections 
between  various  set  theories  and  topoi . 


( d )  The  axiom  of  comprehension/abstraction  revisited 


Finally,  let  us  tie-in  the  basic  approach  to  set  comprehension 
or  abstraction  through  formal  language  L  where  some  or  all  of  ap¬ 


pended  theory  Th. 


(and  the  appropriate  part  of  Th 


syn, 


U)> 


is 


present,  with  the  set  theory  axioms  (and  consequently,  sat  theories) 
discussed  earlier.  (Again,  the  basic  reference  is  [172],  pp.  65-86, 
246-254 . ) 

Let  l  be  a  formal  language  as  presented  in  2.2.2,  but  with 
Th^fl)  to  be  specified. 


First  note  that  for  any  i,m  e  Var(0b(i))  ,  with  m  2  i  x  fl1  , 
i.e.,  m  =  i  x  n1  x  k  ,  etc.,  for  some  k  e  Var(0b(l))  (possibly 
vacuous  so  that  m  =  i  x  n1)  , 


V  a  lx'1'  «.  y'”1') 
m  i  Jm 


=  e ,[<x(l) ,y(R  *>] 
l L  m  'm  1 

e  Wff(l)  , 


with 


o  (ip  )  =  m 


Now  consider  the  converse 
FV(y)  =  x^^  x  z^-^  ,  without 

i  ,  j  «  Var  ( Ob ( I )  )  ,  x(i),z<;!) 


problem:  Given  any  f  e  Wff (L)  with 

loss  of  generality,  for  some 
e  Wfv(i)  ,  and  writing,  w.l.o.g.  in 
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substitution  form 


y  =  yfx^  ]  , 

does  it  follow  that  the  following  holds,  connecting  dummy  variable 
and  individual  variable  notation: 


(GC) 


(V2(j))(3y 


l"1'|[v>ll,MU111 


{n1 ) . 
y  ) 


y  [x 


U) 


z( j)]) 


Generalized  Comprehension  or  Abstraction? 

Note  first  axioms  ^  22  and  23  put  in  individual 

variable  form  imply  the  desirable  property 


(GC1  ) 


(x 


(i) 


{x 


(i) 


y  [x 


U) 


,z(j)]>) 


y  [x 


( i : 


which  in  turn,  because  of  the  obvious  fact  that 
a  t-  h  b 


implies  always 


(a  «=»  b)  , 

we  then  have  the  weaker  form 

(GC")  ^((vx^'lix111  «i  {x(i)  |  y  tx(  ,z(  ^  }  ] } ) 


y  [x 


:d 


,z(J)])) 


the  theorem  of  set  compatibility. 

But  clearly  by  straightforward  use  of  3  ,  and  in  effect 

letting 


y(«  )  „  {XU)  |  y  [X(  ,z(  ] )  , 

(GC")  implies  (GC)  . 

Also,  R-r  implies  the  basic  substitution  form  for 

I,syn,35 

(  f  :  t  j  )  e  Var  ( Ar  {i )  )  , 

y-L  (<x(1)  |  y  [x(  1 }  ,z(  3’ }  ] )  <»f  *  <x(i)  |  y[x(1),z(j)  of]))  , 

etc.  Furthermore,  if  n  ii  :  t  -*  (c,x)  is  a  model,  then  for 
o(y )  =  i  x  j  ,  w.l.o.g.  , 


<x^  |  y  [x^  .zf^  ]  }  e  Ar  ^  ( Var  ( £  )  )  S  Var(ArU)) 

j,n 


and 


ii  (x 


(i) 


y  [x 


(i) 


m 


m  J 


ii  ju 


IIRill 


(See  2.3.5  for  further  details.) 
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In  addition,  note  that,  e.g.,  axioms  (Z12)  ,  complements, 
(Z22)  ,  intersections,  and  (Z4)  ,  unions  ,  all  yield  set  compatible 
forms.  For  example, 


(Z12)  i-(Yy(n  !)(3z,G  }  )  (Vx!i!  )  (  (x(i)  e.  z(n  ) 


nt(xU)  e.  y(nl))) 


yields 

( Z12"  )  ((vy(n  ')(3z'G  5  )  (vx(  1 !  )  (  (x(  1 1  e.  z(n  !) 

x(1>  .l(x(11  j  ntlx*11  .jV1"1’)))) 


and  (Z22)  specialized  to  binary  intersections 


( Z22  1  ) 


•“z((vy(n  hivw'0  ’liaz*15  J)(vx(l)) 


(x<1}  e  2^>)  ~  ((x(i,.1y(al,)«(x<1,«1  w^1'))) 


implies 


( Z22 " ) 


►-r((vy(fi  ))(vw<a  })(3z(R  )  )  (vx(l)  )  (  (x!l)  e  2(fi  >)  «=> 
(x(1)  €  {x(1)  |  (x(i)  y(n  })  &  (x(l)  «i  w(n  )))))  . 


Thus  we  make  the  new  compound  definitions  for  the  arrows 


cy(a  >  A  (x^ 


nt(x(i)  e .  y(R  1 )}  , 


y<“  >  n  w<n  >  S  (X(D  ,  ,*<*>  ^  y<“  >  a  (x(i)  e.  w<“  »  ,  }  , 

J  ix(1>  |  (x'1’  .j  y(ni>)  or  (x(1>  «.  w'0*’))  , 
and  more  generally,  for  any  combination  of  c  ,  n  ,  u  , 


comb ( C ,n,u) (y|“} ' . y|”j ) ) 

=  (x(1)  |  comb(  C  ,n  ,u  )  (  (x{  1 J  e  y|” ,  1  ),...,  (x(  1  *  e  yj^*))  , 


:U)  €  ( «a) 

i  Y(U 


i  Y(m) 


with  similar  definitions  holding  for  multiple  arguments  (replacing 
single  x^^)  and  cartesian  products  and  sums. 

For  related  results,  see  section  2.3.5  (b)  . 


Many  difficulties  inherent  in  developing  the  basis  for  a  multi¬ 
valued  set  theory  are  discussed  in  Chapter  VI,  [172]  (especially, 
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pp.  236-254).  Many  competing  set  theories  abound.  The  positive 
results  for  Maydole’s  advanced  set  theory  relative  to  are 

encouraging.  However,  set  theories  for  other  logics  such  as  the 
general  truth  functional  one  generated  by  triples  of  negations, 
t-norms,  t-conorms  (see  section  2.3.5),  may  also  be  worthy,  although 
for  such  logics,  it  may  be  difficult  to  show  any  soundness  or  com¬ 
pleteness  properties,  or  similarly,  it  may  be  difficult  to  show 
inconsistencies  are  not  plausible. 


2.3.5  General  logical  systems  and  dispersions. 

(a)  Discussion. 

In  addressing  the  problem  of  determining  the  state  or  states 
present  (or  partially  present)  of  a  given  system  of  interest,  i.e., 
the  uncertainty  of  the  system,  an  uncertainty  model  must  be  es¬ 
tablished,  which  in  some  sense  captures  both  the  system  and  its 
uncertainties.  One  way  this  can  be  accomplished  is  through  a  com¬ 
plete  intuition/heuristic  approach,  where  a  single  individual  or 
panel  of  experts  considers  the  problem  at  hand  and  in  a  coordinated 
manner,  using  his/her  (or  their)  pooled  knowledge,  attempts  a  solu¬ 
tion.  This  is  based  on  the  general  principle  that  human  beings  can 
successfully  attack  very  complicated  problems  by  use  of  common  sense 
and  learned  knowledge.  Another  approach  is  to  establish  a  rigorous 
systematic  model  of  the  problem,  consisting  of  a  choice  of  measures 
or  (possibly  generalized)  distributions  or  dispersions,  together 
with  some  collection  of  logical  and  other  types  of  operators  acting 
upon  them  according  to  some  prescribed  calculus.  We  call  such  a 
model,  a  general  logical  system.  Another  approach  is  to  attempt  to 
integrate  the  first  two  approaches  by  some  kind  of  feedback-loop 
procedure,  so  that  the  model  of  the  problem  reflects  the  experts' 
knowledge  and,  at  the  same  time,  these  experts  may  be  used  to  over¬ 
ride  all  decision  making  outputs  or  to  adjust  or  correct  the  model 
in  an  adaptive  manner.  The  last  approach  serves  as  the  basis  for 
the  concept  of  knowledge-based  systems  ,  which  play  a  central  role  in 
the  AI  approach  to  complex  problem  solving. 

In  Chapters  8  and  3,  some  aspects  of  knowledge-based  systems 
will  be  considered  in  addressing,  especially,  problems  concerning 
the  combining  of  evidence  or  clues  describing  the  states  of  a  system 
that  are  present,  represented  in  terms  of  parameter  vectors. 

General  logical  systems  may  be  considered  from  the  following 
general  viewpoint: 

A  general  logical  system  is  essentially  a  semantic  evaluation 
of  a  formal  language  in  a  collection  of  (deduction)  categories  which 
in  some  sense  generalize  SET,  the  category  of  all  ordinary  sets. 
In  general,  these  categories  do  not  have  a  topos  structure  and  con¬ 
sequently,  may  not  be  sound  and  complete  (deductively).  However,  it 
is  hoped  that  at  least  "locally"  -  in  the  same  sense,  that  local 
theories  are  developed  for  disciplines  such  as  for  quantum  mechan¬ 
ics,  topology,  mathematical-biological  systems,  and  number  theory  - 
the  system  will  be  consistent.  See  related  comments  on  adaptive  or 
"local"  logics  by  Bellman  [13‘]  and  Bellman  and  Zudeh  [15].  The 
first  reference  also  contains  interesting  comments  on  uncertainty 
decision  making  and  artificial  intelligence  by  one  of  the  eminent 
scientists  of  our  time.  See  also  Haack's  texts  on  nonstandard  or 
deviant  logics  -  [100]  as  well  as  [101],  In  addition,  see  section 
10.2  concerning  the  various  controversies  involving  fuzzy  logic. 

The  following  discussion  is  based  in  part  upon  the  properties 
of  several  categories  and  topoi  which  in  a  natural  sense  generalize 
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or  fuzzify  SET:  (See  section  2.4  for  a  detailed  presentation  of 
these  entities . ) 

Candidates  for  the  range  of  the  semantic  evaluation  represent¬ 
ing  a  general  logical  system  include,  naturally,  Higg(H)  ,  f’uz(H)  , 
Fuz^(H)  ,  and  Gog(H)  .  First,  consider  Higg(K)  .  As  a  topos,  it 

has  "nice"  properties,  as  mentioned  above.  However,  it  is  often  too 
strict  a  structure,  since  a  =  min  ,  v  =  max  ,  and  the  rest  of  the 
intuitionistic  logic  structure  are  not  compatible  with  general  nega¬ 
tion,  t-ncim  or  t-conorm  structure  (see  2.3.6),  the  latter  being 
more  suitable  for  natural  language/real-world  use.  (See,  e.g.,  the 
comments  of  Thole  et  al . ,  [250],  e.g.)  In  addition,  Higg(H)  has  a 
more  restricted  and  complex  structure  than  Fuz(H)  ,  as  the  investi¬ 
gations  in  2.4  attest.  On  the  other  hand,  Fuz(H)  is  not  a  topos, 
as  seen  in  2.4,  and  is  also  restrictive  in  that  intuitionistic  logic 
-  except  for  equality  ( * )  -  is  required.  Although  some  Fuz_(H)'s 

are  topoi  for  appropriate  choice  of  *  (see  2.4)  ,  they  have  rela¬ 
tively  sparse  structures.  Hence,  one  is  led  to  consider  their  union 
FUZ(H)  .  But  FUZ(H)  has  still  some  typpe  of  intuitionistic  logic 
on  it;  in  particular,  Fuz(H)  still  depends  highly  on  a  and  v  in 
the  definition  of  its  arrows.  (See  also  Pitts'  recommendation  for 
Higg(H)  in  place  of  FUZ(H)  [203].) 

The  remaining  candidate  Gog(H)  ,  which  in  one  sense  is  close 
to  Fuz(H)  in  structure  -  no  intuitionistic  logic  structure  - 
indeed,  no  logical  structure  -  is  required  for  its  arrows.  In  par¬ 
ticular,  using  the  fact  that  t-conorms  or  copulas  are  bounded  below 
by  v  ,  we  can  consider  the  subclass  of  arrows  in  Gog(H)  exactly 
connecting  objects  in  Gog  .  This  subclass  of  arrows  will  yield  a 
subcategory  of  Gog(H)  ,  due  to  the  associativity  of  t-conorms.  How¬ 
ever,  even  this  may  not  capture  all  operations  of  importance,  such 
as  copulas  and  co-copulas  -  which  may  be  non-associative  -  as  used 
in  Sklar's  Theorem  for  joint  distribution  functions  (see  2.3.9). 

In  any  case,  we  will  first  define  a  general  logical  system  as  a 
semantic  evaluation 

II  II  :  t  -  (GOG,  Sub) 


where 


GOG  =  u  (Gog(H) , Logic  ) 
ae  I  1 

for  some  index  set  I  ,  and  where  each  (Gog(H),  Logic^)  repre¬ 
sents  Gog(H)  and  a  collection  of  logic,  quantifier,  and  possibly, 
foundations  operators  (see  2.2.2)  acting  upon  Gog(H)  ,  which  for 
simplicity  is  denoted  LogicQ  . 

Often  a  general  logical  system  contains  only  a  small  list  of 
basic  operators  such  as  "nt","&",  "or"  as  simply  (nt,  &,  or)  . 

Next,  for  any  choice  of  is  Var(0b(£))  , 

define 

II  i»  =  (iiiii(1),  n  id  (  2  5  )  i  (Xi,ei)  €  Ob  (  Gog  (  H  )  )  , 


X.  €  Ob  ( SET )  ,  9  .  :  X .  H  , 

1  ll 


where 
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and  we  define  generalized  or  fuzzy  set  A^  as 


with  membership  function 

$  =  n  ill  (  2  ]  =8  . 

A .  i 

l 

For  any  j  <=  Var(Ob(£))  such  that 

ujii*2)  =  J-H  -  H  :  Xj  "*  iH  ( constant )  , 
identify  n  jii  =  (X.,i„)  with  fa  . 

J  ** 

For  any  j  «  Var(Ob(t))  such  that 

njii^2'  =  Th  ,  Th  :  Xj  Th  (constant)  , 

identify  li  jil  =  (X.,TU)  with  X,  e  Ob(SET)  . 

J  H  J 

Next,  for  any  X  e  Ob(SET)  ,  consider: 

*(X)  =  {A..  |  i  e  Var  ( 0b(  l  )  )  and  H  ill  ( 1  *  =  X) 

and  assume  there  exists  i^  e  S(X)  such  that 
nijjii  =  X  . 

We  call  MX)  a  class  of  neneralized  or  fuzzy  subsets  A^  (=i) 
of  X  . 

MX)  =  { n  i  n  (  2  5  |  i  «  Var  ( Ob  ( £  }  and  iiiii(1)  -  X)  s  HX  . 

We  call  3"(X)  a  class  of  dispersions  or  generalized  (or  fuzzy) 
set  member*!,  ip  functions  (=  04)  over  X  . 

i 

Depending  on  n  n  ,  of  course,  often  we  may  have  MX)  =  , 

for  all  X  e  Ob ( SET )  . 

We  assume  that  n  n  ^ 2  ^  :  Var(Ob(t))  -»  Ar(SET)  is  injective. 

Related  to  this  map  is  the  membership  function  -  map  <}>  ,  where 

4>  :  Ob  (  SET  )  -  Ar  (  SET  )  , 

where  for  all  X  €  Ob(SET)  ,  $(X)  e  MX)  where  for  all  A.,  e  5(X)  , 

<t(X)(A.)  =<$>.=  lli«(2)  =  6  . 

1  Ai  1 

Note  that  <?>  ( X )  :  7(X)  MX)  is  also  injective,  for  each 

X  e  Ob ( SET )  . 

Suppose  for  convenience  now  H  =  [0,1]  . 
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^or  be  a  9iven  t-conorm  (see  2.3.6). 


Let  f  :  xi  -  Xj  ,  g 


Xj  Xk  be  ordinary  functions  with 


ordinary  composition  go f  :  x.  -  xk  .  Consider  now  any 
®i  :  xi  ■*  H  -  but  9  j  :  Xj  -•  H  and  ek  :  Xk  H  defined  by 


VY)  ~  ^or  (9i<x)>  =  <?cr(si  [f'1]  (x)  )  &  V  (S. 

kef_1(y)  xef''1(y) 

all  y  €  Xj  , 


<««(*))  : 


so  that  in  particular,  for  any  x  <=  X, 


8 i(f (X) )  *  « 

1  •  or 


(s i lx' ) )  >  e i ( x )  , 


x'ef-1(f (X) ) 

and  hence  f  €  Ar(x  ,  ,e  ,(G°g(H))  in  the  above  "exact"  sense. 

l  i  j  j ' 

^  similarity  of  the  approach  here  with  that  of  a  large  class 
of  functional  extensions  considered  in  2.3.7. 

Analagously ,  we  may  define  for  all  z  e  xk  , 

V2)  =  tf>or_  C®  j  (y* )  )  - 

Y'cg  a(z) 

yield!"::  similarly  „  ,  Ar ^ ^  <Gog,H)  i  . 

In  turn  because  of  the  associativity  of  4  ,  it  follows  that  for 

all  x  e  x.  , 


ek(g(  f  (x) ) )  =  cpc 


y'eg"  (g( f (x) ) ) 


(e  j ( y  ’ ) ) 


=  <5>or  (•!(*•)) 

x'«f“1(y  ) 

y« g_1  (g(  f  (X) ) ) 


( o  d  ( x  ’  )  ) 


x‘€(g°f)  ifgofjfx)) 

(g° f ) (x) )  . 


and  hence  qof  «  Ar  \  , 

r  ^  Xi  ’  9  i  i  '  ^  Xk  '  9  k  ^  G°9  H  '  aiso  ln  exact  sense 

as  above . 

Conversely,  given  any  (X^e.)  ,  end  any  (X  e  ,  e  ob(Gog(H))  , 
does  there  exist  some  f  :  X1  -  X^.  such  that 
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0. j(f(x))  =  <^or  (01(x'))  ,  for  all  x  e  Xi  ? 

x'«f"1(f (x) ) 


At  least  one  such  f  always  exists,  provided  e.  and  are 

normable,  i.e.,  there  are  points  x  e  X.  ,  y  €  X.  such  that 

^  i  1  o  j 

s i ( xo )  =  and  e^(xo)  =  T^  and  that  9 is  continuous. 

In  any  case,  we  have  a  well-defined  subcategory  (Gog(H),$Q  ) 
of  Gog(H)  with  possibly  some  empty  classes  of  arrows,  i.e.. 


'(X1,01),(XJ,0j)(G°9(H)^or)  " 


for  some  pairs  (X^.e^)  ,  (X^,e^.)  €  Ob(Gog(H))  =  Ob  ( Gog  ( H )  ,  4<or )  . 

Furthermore,  we  may  afix  other  logical  operators  such  as  a 
negation  $  :  H  -*  H  and  t-norm  H  x  H  -*  H  .  Of  course, 

these  operations  do  not  affect  the  category  structure  of  Gog(H) 
(and  hence  ( Gog  ( H  ),<}>  )  )  except  to  be  used  in  computing  semantic 

evaluations  for  various  elements  of  Wfex(r)  ,  i.e.,  well-formed 

expressions,  as  shown,  e.g,,  in  sections  2.2.2,  2.3.2,  and  2.3.6. 

(See  also  section  2.6  for  extensive  examples  of  the  evaluation  of 
natural  language  forms  via  formal  language.) 

Thus,  in  summary,  an  important  class  of  general  logical  systems 
may  be  indicated  as  (C,®,®;  7)  or  ^nt'^fi'^or’  7 )  where 

7  :  Ob  ( SET )  -•  Ob  ( SET )  or  more  specifically  7(X)  £  HX  ,  for  each 

set  X  . 

General  logical  systems  are  related  to  L-valued  logics,  which 
are  generalized  by  Czogala  [44''']  to  probabilistic  L-valued  logics, 
using  Hirota's  "probabilistic  sets"  [111). 

Again,  it  should  be  emphasized  that  dispersions  as  discussed 
here  generalize  the  concept  of  membership  functions  of  ordinary 
sets,  i.e.,  functions  of  the  form 


Vx) 


1  iff  x  €  A 
0  iff  x  «  A  , 


where  A  is  an  ordinary  subset  of  X  ,  x  e  X  ;  X  also  an  ordin¬ 
ary  set.  Even  before  Zadeh  [276'),  Klaua  [139]  and  others  (such  as 
Watanabe  [268])  extended  membership  functions  to,  in  effect,  gener¬ 
alized  or  fuzzy  subsets  of  an  ordinary  space.  See  also  2.3.6  for 
the  development  of  families  of  operators  upon  such  membership 
functions,  extending  ordinary  set  operations,  including  complement, 
intersection  and  union.  Indeed,  the  manipulation  of  classical  sets 
and  functions,  independent  of  any  generalized  set  considerations, 
has  often  been  facilitated  by  the  employment  of  (classical)  mem¬ 
bership  or  "indicator"  functions  of  sets,  such  as  will  be  found  in 
any  standard  textbook  on  Lebesgue  integration.  In  conjunction  with 
this  idea,  see  the  recent  paper  of  Morrill  [179'],  where  the  use  of 
membership  functions  for  teaching  set  theory  is  emphasized  -  but 
completely  ignoring  the  fact  that  the  same  or  similar  formal  manipu¬ 
lations  may  be  used  to  discuss  generalized  sets!  Furthermore,  after 
a  moment's  thought,  the  category  theory  concept  of  subobject  (or 
functor  Sub  -  see  Appendix  1,  end  of  section  2.4.2)  is  also  compa¬ 
tible  with  the  Idea  of  a  membership  function. 


Symbolization  and  Evaluation  of  Language 


89 


Two  obvious  examples  of  generalized  sets  in  our  everyday  lives 
are  given  below: 

In  the  clothing  business,  size  of  women's  clothing  is  in  gen¬ 
eral  determined  by  height  and  weight  through  a  few  simple  natural 
language  labels  such  as  "petite",  "average",  "tali",  "full  figure 
short",  full  figure  tall".  Despite  the  common  use  of  these  terms  to 
describe  ordinary  sets  in  height  vs  weight  space,  for  purposes  of 
convenience,  the  boundaries  of  these  "sets"  should  actually  be  fuzzy 
and  could  be  so  indicated  by  lightening  the  colors  near  the  boun¬ 
daries  ,  etc . 

The  above  remarks  also  apply  to  visual  examples  such  as  a  cloud 
in  the  sky.  Thus,  we  could  conceivably  divide  up  a  region  of  the 
sky  into  cells  and  determine  the  percentage  of  black  vs  white  (vs 
grey,  etc.)  pixels  or  primitive  picture  dots  within  a  given  cell, 
and  in  turn  convert  the  percentage  of  black  ones  directly  into  a 
membership  function.  Thus,  for  cells  well  within  the  "boundaries" 
of  the  cloud,  membership  levels  are  approximately  unity,  for  cells 
near  or  on  the  boundary  area,  membership  levels  drop  and  for  cells 
well  outside  the  boundary  area,  membership  levels  are  approximately 
zero  . 

These  ideas  appear  also  to  be  related  to  the  classical  notion 
cf  the  density  of  a  set  (see,  e.g.,  [181'],  pp .  261-263)  and,  as 
well,  to  extended-dimension  concepts  as  presented  in  Mandelbrot's 
exposition  on  fractals,  i.e.,  highly  irregularly-shaped  geometric 
objects,  typically  appearing  in  nature  [167'].  In  addition,  inde¬ 
pendent  of  fuzzy  set  theory  directly,  Mandelbrot  also  considered,  in 
effect,  generalized  set  membership  functions  over  "gaps"  of  frac¬ 
tals  arising  from  random  set  coverages  (167'],  d.  366),  See  also 
Chapters  3  -  5  of  our  text,  for  various  relations  between  random  set 
coverages  and  generalized  set  membership  functions. 

Three  general  examples  of  general  logical  systems  are  given 

n< t  . 

In  the  previous  paragraphs,  we  have  seen  how  the  membership 
symbol  and  its  natural  evaluation  determine  membership  mappings.  A 
membership  mapping  $  depends  on  H  •  n  .  However,  for  simplicity, 
when  no  confusion  is  possible,  we  will  drop  this  dependency  in  our 
writing,  but  not  from  our  mind!  The  membership  mappings  turn  out  to 
play  a  crucial  role  in  developing  the  concept  of  general  logical 
systems  as  we  now  proceed.  The  motivation  for  out  general  defini¬ 
tion  below  will  clarified  through  examples. 


Exam pi es. 

1 .  Boolean  system.  Let  t  be  the  formal  language  of  ordinary 
predicate  calculus.  For  each  set  X  ,  7(X)  is  the  class  of  sub¬ 
sets  of  X  ,  and  7(X)  is  the  class  of  all  ordinary  set  membership 
functions  over  X  . 

2.  Zadeh's  system.  Let  l  be  the  formal  language  with 

Lukasiewicz  logic  n  .  For  each  set  X  ,  5(X)  is  the  class  of 

X 

all  generalized  sets  on  X  ,  and  7(X)  =  [0,1]  ,  the  class  of  all 

dispersions  over  X  . 

3.  Probabilistic  system.  Let  £  be  the  formal  language  of 

Probability  Logic  (see  section  2.3.3).  Let  Y  e  Ob(SET)  and  choose 
X  5  9 ( Y )  ,  where  X  is  a  o-algebra  of  subsets  of  Y  .  Then  7(X) 

corresponds  bijectively  to  7 ( X )  ,  the  class  of  all  probability  mea¬ 

sures  over  Y  .  (See  also  section  2.3.9.) 
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b)  Set  operations  for  general  logical  systems. 

Note  again  (see  section  2.3.4  (b)  )  the  relationship  between 

Wff(t)  and  all  generalized  sets  through  the  Axiom  of  General 

Comprehension  (GC)  and/or  the  contextual  or  operator  approach  to 
class  abstraction. 

On  the  other  hand,  we  have  stated  (2.3.4)  that  (see  originally 
Maydole  [172])  if  set  axiom  (GC)  is  accepted  in  conjunction  with 
any  logic  of  interest  (see  again  section  2.3.4  for  the  list),  except 
for  ,  a  Curry-like  parado-',  and  hence  inconsistency,  occurs. 

More  generally,  Maydole  [172]  showed  that  (GC)  in  such  cases 
implies  the  validity  of  several  inference  rules  for  all  logics 

of  interest,  including  quasi-universal  and  quasi-existential 
instantiation,  as  well  as  other  quasi-forms  (quasi-modus  ponens , 
simplification,  absorption,  etc.).  (He  also  showed  that  whenever 
the  above  inference  rules  hold,  a  Curry-like  paradox  indeed 
results . ) 

Now,  many  set  theory  axioms  (see  2.3.4  for  various  listings) 
will  be  of  the  form 

Q(4>  «=»  *)  . 

where  Q  consists  of  a  string  of  universal  and/or  existential 

quantifiers  and  $  and  f  are  (in  general,  open)  wff's  for 

formal  language  l  .  For  example,  complementation  (212)  ,  car¬ 
tesian  products  ( Z 1 3 )  ,  unions  (Z4)  ,  etc.,  all  fall  under  this 

classification . 

It  follows  readily  that  for  a  given  general  logical  system  with 
the  usual  logical  constants  nt  ,  &  ,  or  ,  »  ,  «=»  e  Loc(£)  ,  if  the 
quasi-form  inference  rules  (section  2-3.4)  such  as  (QUI),  ( QEI ) , 

( QME ) ,  (QS),  etc.,  hold,  than  any  such  set  theory  axiom  as  mentioned 
above  will  imply  at  least  in  an  asymptotic  sense  the  existence  of 
corresponding  class  abstract  "sets".  The  term  "asymptotic"  may  be 
omitted,  if  the  stronger  non-quasi  inference  rules,  (UI),  (El), 

( ME ) ,  etc . ,  hold . 

As  an  illustration,  consider  the  following  development  for 
(Z12)  ,  where  i  e  Var(0b(£))  is  arbitrary,  e^(x,A)  is  replaced  by 

simply  (x  e  A)  ,  for  any  x,y,z,  .  .  .  e  wfv^(i)  and  general  sets 

A , B , C , ...  €  Wfv  . (1)  : 

a1 

From,  e.g.,  Maydole  [172],  p.  77, 

(  Z 1 2  )  =  Q($  <=»  ¥■)  , 


where 

Q  -  (Vir  )  (3a) (Vp) 

{P  )  =  (pea  ) 

V  ^  TP  (p  ,  i  )  =  nt  (pei  )  . 
Then,  applying  (QUI) 
n  (  Z12  )  II  =  1 


Symbolization  and  Evaluation  of  Language 


91 


implies  that  for  all  t 

II  (3a)  (vp)  (4>(p.a)  «=>  TP  (/ 5,t))H  =  1  . 

In  turn,  applying  ( QEI ) ,  it  follows  that  for  all  t  and  each  inte¬ 
ger  mil,  there  is  an  a  such  that 

m 

HV/5(<J>(/3,am)  «=>  <e(p,  t )  )  II  >  1  -  1/m  . 

Next,  again  using  (QUI)  we  obtain,  for  all  t  ,  each  integer 
m  i  1  ,  a  corresponding  a  ,  such  that  for  all  v  , 

il*(v,a  )  «=»  ?  ( v ,  t )  II  i  1  -  1/m  . 
m 


Finally,  if  quasi-material  equivalence  and  quasi-simplification  hold 
for  L  (see  Maydole  [172],  p.  124),  then  if  ii  li  is  truth  function 
with  respect  to  =»  and  il=»u  is  continuous,  then  for  all  t  ,  each 
integer  m  i  1  ,  a  corresponding  am  exists  such  that  for  all  v 

|n$(v,a  )n  «=»  n  y  ( v ,  t )  ii  1  <  1/m  . 

We  could  call  each  such  a  as  above  the  m**1  approximating  com- 

m 

plement  c  ( ">  )  of  i  . 

Hence  in  an  asymptotic  sense,  for  any  given  t  e  wfv  ,(/.)  we 

n 

can  postulate  Ci  «  Wfv  ,(£)  such  that 

n1 

up  «  cm  »  imt(p  «  i)»  . 

Of  course,  if  the  stronger  inference  rules  ( Ul ) ,  (El),  (ME), 

etc.,  hold  -  as  is  true  for  L_  or  G  then  the  above  proof  may 

mm 

be  appropriately  modified  so  that  the  asymptotic  interpretation  may 
be  omitted. 

Furthermore,  for  (as  was  stated  in  section  2.3.4), 

*1 

Maydole's  modification  of  (GC)  together  with  Hayes'  logical  axioms 
will  imply  the  validity  of  the  quasi-form  inference  rules  and  yet 
consistency  will  also  hold  (Maydole  [172]),  although  the  logic  is 
deductively  and  logically  incomplete.  Hence,  at  least  in  an  asymp¬ 
totic  sense,  complementation,  cartesian  products,  intersections, 
unions,  etc.,  all  may  be  safely  defined  -  with  no  paradox  derivable. 

In  any  case,  motivated  by  the  above,  we  can  define,  for 
example,  the  following  basic  general  set  operations,  for  truth 
functional  systems;  through  a  class  abstract-like  approach.  (See 
also  the  previous  discussion  in  2.3.4  (d).) 


(  i )  CoMpleaentation 

C  :  Wfv  i {l )  -  Wfv  i {l )  . 

n  n 
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For  any  A  e  Wfv  .  (t)  ,  there  is  a 

n1 

CA  e  wfv  .  (t)  such  that,  for  all  x  e  Wfv.(Z)  , 

n1  1 


whence 


nt(x  e  A)  =  (x  e  CA)  , 


it  nt  (x  e  A)  H  =  until  ( H  x  e  All  )  =  il  ntll  (  u  All  (  n  xii  )  ) 
=  until  (<t>A(x)  )  =  $nt($A(x)) 

=  II  x  e  C  All 


=  *CA(X) 


( i i )  Cartesian  product 


x  :  Wfv  .{t)  x  Wfv  At)  -  Wfv  .  ,U)  , 

n  £2-)  n  3 

For  any  A  e  wfv  ,{t)  ,  Be  wfv  .  (z)  and  any 
fi1  n 3 

x  e  Vlfv^(t)  ,  and  y  e  Wfv^{£)  , 


whence 


(x  e  A) & { y  e  B)  =  { (x,y)  e  A  x  B))  , 


II  (x  e  A)  &  (y  e  B )  II  =  II  fill  (II  All  (II  xll  )  ,  MBll  (ilyli  )  ) 

=  ^&^A{X)  '  (f,B(y)  } 

=  H  (x,y)  e  A  x  bii 

m  *AxB(X'y)  • 

( i i i )  Intersection : 

n  :  Wfv  At)  x  Wfv  At)  -  Wfv  .(t)  . 

na  n1  n1 

Specialize  cartesian  product  in  (ii)  for 
i  =  j  ,  x  =  y  : 


whence 


(X  e  A )  &  ( x  e  B)  =  (  (X,X)  e  A  x  B)  =  (X  e  ATIB )  , 


ii  ( x  €  A )  &  ( x  e  B  )  II  =  $&(<}>A(x)  ,  $B(x)) 

=  ^AxB(X'X) 

=  II  x  €  AOBH 


^AnB(x^ 


( iv)  Cartesian  sun 


t  :  Wfv  .(£)  x  Wfv  At)  -  Wfv  ,  At)  . 

n  n-3  n1*-3 

For  any  A  e  Wfv  At)  ,  Be  Wfv  .(t)  and  any 

£2  n3 


x  e  Wfv^(t)  ,  y  e  WfVj(t)  , 


(x  e  A)  or  (y  e  B)  =  ( ( x , y )  e  A  t  B) 
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whence,  analogous  to  (ii), 

ll  (x  €  A)  or  (y  e  B )  II  =  <t>or (  4>A<  x )  -  <$>B ( Y )  ) 

=  <PAlB(x>Y)  ■ 

( v)  Union 

U  :  Wfv  .(it)  x  Wfv  .(*)-.  Wfv  .{£)  . 

fi  n  n1 

Analogous  to  (iii),  specialize  cartesian  sum  in  head¬ 
ing  (iv)  for  i  =  j  ,  x  =  y  : 

(x  e  A)  or  (x  s  B)  =  ( ( x , x)  «  A  t  B)  =  (x  e  AUB)  , 
whence 

II  ( X  €  A)  or  (x  €  B ) ll  =  4>or  (#A(x),  #B (x)  ) 

=  4>AtB(x,x) 

=  ll  x  €  AUB  ll 
=  ^AUB(X)  • 

Symbolically,  we  may  write  more  generally, 

(Xj . xr)  e  comb  ( nt ,  &,  or )  ( A^. ,  .  .  .  ,  A  ) 

=  comb ( nt ,  &,  or)(<xJ  e  A1>,...,<xn  e  Ar> )  , 

where  some  of  the  A^'s  and/or  xj's  <  Independently,  may  be  the 

same,  and  "comb"  refers  to  some  combination  of  operators.  In  the 
above,  we  could  also  use  nt ,  x,  t,  »  ,  etc. 

Of  course,  all  of  the  above  operations  in  (ii)  -  (v)  may  be 
extended  in  the  obvious  way  to  an  arbitrary  finite  number  of 
arguments,  and  further  by  standard  limiting  procedures,  to  an 
arbitrary  number  of  arguments.  Note  the  use  of  $  for  until  ,  <}>& 

for  ii&ii  ,  $  for  Horn  ,  reminiscent  of  negation,  t-norm  and 

t-conorm  operators  discussed  in  section  2.3.6.  In  fact,  often  the 

general  logical  system  will  be  such  that  d>  .  ,  tb.  ,  *  are  such 

nt  &  or 

operators . 

For  non-truth-functional  logics  such  as  Probability  Logic,  the 
above  relations  must  be  modified. 

( vi) Subset  relations  may  be  defined  in  two  different  ways: 

For  any  i  e  Typ(t)  ,  A,B  *  Var  .{£)  , 

n1 

(a)  A  £  B  iff  ,  by  definition, 

4>a  s  i$>b  pointwise  over  X^ 

or 

(b)  ii  A  £  BH  =  ii  &  ((x  «  A)  »  (x  €  B )  )  it  ,  etc. 

xe  X^ 
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Subset  relations  for  generalized  sets  may  be  defined  more 
systematically  as  follows: 

Let  B  e  Var(Ob(£)  .  Then  for  any  A  «  Var(Ob(£))  , 

AS  B  iff  ii  All  S  ii  Bi!  iff  II  All  «  Sub  (li  3n)  . 

Now,  the  generalized  power  class  of  nBn  =  (Xg,8g)  is 

Sub { xB , s B )  =  {¥■  1  r  :  XB  -  H  ,  r  <  sB) 


and  hence 


II  All  S  II  Bit 


iff 


H An  =  y  or  <XB,P)  ,  such  that  M  »B  , 


Also 


nnBii  =  ii  mi 11 B"  =  (Sub(Xg,8g)  ,T) 

(See  2.4.2  (D)  for  basic  properties  of  Gog(H).) 

IV/'ti,)  Membership  relations  for  generalized  sets  are  obtained  from: 
Let  x  e  wfv.(i),  A  e  Wfv  .(£)  ,  i  e  Var(ob (£))  ,  Then 

nJ 

II xe  An  =  II xii  lie  A II  n  AH  . 

Consider  first  iie^ii  : 

ii  e  ^ ii  e  Sub  ( II  ill  x  im1!!) 

=  Sub((Xi,ei)  x  ( {y  |  y  :  xd  -  H  and  y  <  0..},T) 

-  Sub(  {  (x,y )  |  x  €  Xi(  V  :  -  H  and  y  <  8  },  e  )  , 

where  for  all  yeX^,  y  :  Xi  ->  H  ,  r  <  9  ^  , 

He  (x,y )  -  x  ue  ii  y 

^  y (x)  s  e  ^  (x)  • 


Hence , 

II x  e  AH  =  H All  (li xii  ) 

=  4>a("xh  )  . 


A  linguistic  example  is  "John  is  happy", 
li  xii  =  John  , 

ii  AH  =  happy  :  X^  -«  H  e  iifl^li  , 
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2.4. 
(d)  : 


ii  in 

nnin 


=  X,  = 


=  nan 


set  of  humans 

11  i'1  _ 

—  n  , 


lieu  :  X^  x  H  1  -♦  H  (with  restriction)  , 

"John  is  happy"  =  John  e  happy  . 

See  also  sections  2.3.2,  2.6  and  Appendix  3  (iv)  of 

Let  us  consider  in  some  detail  the  semantic  evaluation 


section 
in  2.3.4 


q  A  M*(1> 


f  [x 


u)  „(j: 


]}ll  :  lljll 


ii  n  a  ii 


e  Ar 


II  j  II 


,,.nVGo9(H) 


Thus , 


Xi 

q  :  Xj  -*  Sub  ( it  i  II  )  =  {f  |  f  e  H  with  f  <  pointwise) 

with  the  restriction 

<  T ( q ( • ) )  ,  pointwise, 

i.e.,  no  restriction.  Before  defining  ,  consider  the  semantic 

evaluation  of  v  and  y[.,y^^]  ;  Since  the  evaluation  is 

essentially  in  (Gog(H),Sub)  ,  it  follows  that 

ii  v n  e  Sub(  ii o  (y  )  ii  ) 

=  Sub (  H  ill  x  11  ju  ) 

X.xX 

=  (f  |  f  €  H  ,  f  s  9 ^ ( •  ) as ^ ( • • )  ,  pointwise)  . 

Next  consider,  for  any  l  e  Var(Ob(£))  and  any  (g  :  Hill  -♦  Hill 

x  lljll:  =  Hk ii)  €  Ar(Gog(H))  .  It  then  follows  from  the  definition  of 
pullbacks  (see  2.4.2,  Appendix  1)  that 

Sub  ( g )  :  SubH  <  Ii  -•  Subn  kli 

where  for  any  h  €  Sub ( n  < n  )  ,  letting  mono  (h  :  n mi  -»  imi  )  € 

Ar(Gog(H)),  for  some  n  e  Var(Ob(i))  , 

(ii nil .x  iikii) 

Sub ( g )  ( h )  =  (proj||k||  ,g  :  linn  x  nkii  -♦  iikM) 

h,g 

where 


ii  nil  x  nkii  =  (Xn  x  Xk,  h(  •  )  a  g(--))  , 
h ,  g 

so  that  for  any  (s,t)  e  x^  x  Xk  , 

Sub(g) (h) (s,t)  ^  h(s)  a  g(t)  . 
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In  particular,  note  that 


II  1  ^1! 

m 

:  ii  mil  ->  Dili 

Jm 

:  II  mu  -*  11  j  II 

where  U  mil  =  (X  ,9  ) 

in  m 

Let  g  S  ii  <x ^  ' 
ra 

lly[xll)  ]» 

m  m 


iijn  =  (Xj,Sj)  ,  etc. 

y^-^>n  ,  with  t  =  i  x  1  and  h  =  'f  . 

iivii  [ii  <x  ^ 1  ^  ,  y‘3)»] 

m  'll 

(Sub  n<x^  ,y^>n  )  (n*n  ) 

m  m 

ii  v  ii  {•,••)  a  <u  x^  i  ^  ii  ,  li  y  ^  ^  ii  >  ( •  ■  •  ) 

in  m 

II VII  (  •  ,  •  •  )  A<tlX^  II  (  •  •  •  )  ,11  y^  II  (  •  ■  •  )  >  . 

m  m 


Then 


In  any  case,  we  simply  define  q  as,  for  any  t  e  x^  ,  s  e  x^  , 
q  ( t )  (  s  )  =  inf’ll  (s  ,  t )  . 

If  we  use  the  generic  term  "comb"  for  an  arbitrary  combination, 
of  operators  C  ,  x  ,  t  ,  etc.,  then  for  many  systems,  further  com¬ 
pound  set  operators  may  be  obtained  through  the  relations 


comb ( nt , & , or ) (  •  •  ( x  €  A),  (y  €  B ) , • •  ) 

=  (••x,y,*‘)  e  comb( C ,n ,u ) ( • • A,B, • • )  . 

For  example, 

nt ( ( x  «  A)  or  (y  €  B) )  &  (z  e  C) 

=  nt ( ( x , y )  €  A  t  B)  &  (z  e  C) 

=  ( (x,y)  «  C (A  t  B) )  &  (z  €  C) 

=  ( (x,y,z)  e  (c (A  t  8)xc) )  . 

Algebraic  properties  of  C  ,  x  ,  t  ,  u  ,  n  ,  etc.,  all  are 
derivable  from  the  properties  of  nt  ,  &  ,  or  ,  =»  that  are  pos¬ 
tulated  for  i  .  For  example,  let  us  consider  associativity:  From 
the  above  definitions,  it  follows  that  for  all  x  ,  y  ,  z  ,  A  ,  B  , 
C  , 


(x  €  A)  or  (y  €  B  or  z  e  C)  =  (x,y,z)  e  (At  (B  t  C)) 

((x  e  A)  or  (y  «  B))  or  (z  e  C)  =  (x,y,z)  e  (A  t  B)  t  C)  . 

Thus,  it  "or"  is  an  associative  logical  operator  for  t  ,  then 

clearly,  the  two  left  hand  sides  of  the  above  equations  are  equal 

and  hence  so  are  the  two  right  hand  sides.  Thus,  associativity 

holds  for  t 

At  (B  t  C)  =  (A  t  B)  f  C  . 


Similar  results  hold  for  symmetry,  idempotent,  etc.,  for  t  as 
well  as  analogous  properties  for  x  ,  replacing  "or"  everywhere  by 


Symbolization  and  Evaluation  of  Language 


97 


.  Also,  similar  comments  are  valid  for  properties  involving 
both  t  and  x  such  as  various  types  of  distributivity  and  modu¬ 
larity,  Conversely,  properties  postulated  for  -  ,  t  ,  C  ,  become 
properties  for  &  ,  or  ,  nt  ,  etc.,  in  L  .  Also,  similar  comments 
hold  following  semantic  evaluations  for  the  corresponding  logics, 
truth  functional  cr  non-truth-functional.  Again,  see  2.3.4  for 
various  types  of  operators  with  prescribed  algebraic  properties. 
For  example,  t-norms  in  addition  to  being  required  to  have  certain 
boundary  conditions  also  must  be  symmetric  and  associative  -  and  a 
subclass  -  the  non  Archimedean  ones  -  will  also  be  idempotent . 
Indeed,  a  re-examination  of  the  various  logics  of  interest,  illus¬ 
trates  the  various  algebraic  properties  of  the  semantic  evaluations 
of  the  basic  logical  connectors  and  hence  (by  the  bijective  mapping 
discussed  earlier)  of  the  corresponding  general  set  theory  opera¬ 
tors  . 

Compound  predicates  may  be  formed  in  a  ,  the  formal  language 
class  (the  counterpart  of  the  natural  language  class)  -  and  then 
evaluated  for  truth  content  by  putting  them  into  fuzzy  set  form. 

By  this  procedure,  other  more  complicated  predicates  and  their 
quantifications  may  be  evaluated,  including  universal  and  existenti¬ 
al  quantifiers,  cartesian  sums,  and  products,  logical  implications, 
and  logical  equivalences.  Also,  subset  relations  and  relational  and 
functional  transforms  may  be  evaluated. 

Ac  an  example,  consider  the  semantic  evaluation  here,  in 
(Gog(H) ,Sub) ,  for  quantifications  and  substitution: 

Let  L  be  the  basic  formal  language  and  r  e  Wff(z.)  with 

a  (ip  )  =  1  x  j  ,  i,j  e  Var(Ob(£))  ,  x^.y^  e  Wfv(£)  and  write 

m  m 

*<x(1>.y<J>)  for  y[<x^i)  ,y^'>]  ,  etc. 

Then,  as  usual,  letting  m  =  i  x  3  x  lc  , 

» j»  =  (Xj,,e.)  ,  ujii  =  ' 

iiixjii  =  Hill  x  iijii  =  (Xd  x  Xj,  «1(-)  a  e  ^  ( *  •  )  >  ,  etc., 
imi  e  Sub (o  ( uyii  )  )  =  Sub(nixjn)  , 

i  .  e .  , 

IIPII  :  X,  x  x.  -*  H  with  iiymi  s  0.(-)  a  s  (••)  =  0  (•,•■)  , 

•i  J  J-  J  J 

and,  more  generally,  replacing  <x^^,y^-'^>  by 

m  m 

(f  :  m  i  x  j)  e  Var(Ar(£))  ,  we  have  for  the  substitution  v[f] 
(noting  formally  o(?[f])  =  m) 

»y[f  3 11  =  Sub { 11  fn )  ( it vii  )  . 

Now,  11  f  11  :  11  mil  -»  iiixjii  in  Gog(H)  and  it  follows  from  2.4.2 

(D)  that 

Sub(u  fii )  (iiyin  =  a  (inrii  °  nfn)  . 


11  (vx'1}  )  ir  (xi,a  1  ,  yi,-  1  )  11 

m  m  m 

=  hv  ( i ,  j  ,r )  11 

=  11  vii  (11  in  ,  11  j  11  , 11  vii  ) 

=  a  (0  .  (t)  =»„  inf’ll  (t,  •  )  )  , 

t€X1  1  H 


Also  , 
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ii  {akl]  )(*{ki)  .  £j)  ))» 

n\  m  m 

=  If  3  ( i ,  j  ,  V  )  II 

=  V  (inf’ll  (t,  •  )  )  . 


See  Kwakernaak  [148],  Goodman  [88],  Hisdal  [114]  and  Dubois  and 
Prade  [51],  p.  18,  for  techniques  for  evaluating  "fuzzified"  con¬ 
cepts.  Bellman  and  Zadeh  [15]  (see  also  [14])  were  among  the  first 
to  use  a  restricted  form  of  the  Principle  of  Abstraction  to  fuzzify 
concepts,  relying  on  Lukasiewicz  { ) -logic  (i.e.,  min  for  &  , 

1 

max  for  or  ,  1  -  (•)  for  negation,  etc.)  for  evaluation.  Giles 
[74]  also  has  emphasized  the  connection  with  Lukasiewicz  logic  in 
developing  fuzzy  logic  as  part  of  his  risk-based  subjective  approach 
to  modeling  of  fuzzy  set  concepts.  Skala  [237]  developed  a  number 
of  interesting  properties  of  Lukasiewicz  logic  for  use  in  fuzzy  set 
theory . 

Albert  [2]  has  proposed  an  alternative  (but  similar  in  many 
respects)  development  of  fuzzy  set  theory  and  fuzzy  logic,  within  an 
axiomatic  viewpoint,  including  qualifiers  such  as  "usually"  and  a 
new  approach  to  deduction. 

Bellman  and  Giertz  [14]  in  addition  to  establishing  the  previ¬ 
ously  mentioned  mutual  distributivity  characterization  of  also 

suggested  in  effect  use  of  a  form  of  the  Principle  of  Abstraction. 
In  practice,  the  required  mutual  distribution  conditions  may  be  too 
restrictive.  For  example,  Thble,  et  al .  [250]  and  Zimmermann  [291] 
have  shown  through  empirical  considerations  that  other  forms  of  <}>& 

and  <f>or  may  be  more  appropriate.  Interestingly  enough,  a  basic 

survey  (see,  e.g.,  Rescher  [214])  of  multi-valued  logic  shows  a 
paucity  of  choices  for  "and",  "or",  and  "not"  -  most  picking  some 
variation  of  Zadeh 1 s  original  choice  (independent  usually  of  Zadeh 's 
work) .  Other  restrictions  may  be  placed  upon  fuzzy  set  systems  such 
as  the  DeMorgan  relation  or  requiring  the  satisfying  of  certain 
functional  relations  (again,  see,  [140]). 

Thus,  a  calculus  of  fuzzy  set  operations  may  be  established. 
(See,  e.g.,  Dubois  and  Prade  [51]  for  extensive  listings  of  opera¬ 
tors,  relations  and  applications.) 

The  basic  problem  of  choosing,  for  a  given  situation,  which 
fuzzy  set  system  is  m°st  appropriate  may  be 

approached  by  both  empirical  considerations  and  theoretical  guide¬ 
lines  (by  appealing  to  random  set  connections  -  see  the  last 
comments  of  this  introduction).  (See  [250],  [291];  also  Goodman  and 
Nguyen  [92].)  Also,  modeling  of  possibility  functions  and  their 
modifications  poses  a  problem  -  but  essentially  no  different  than 
that  of  choosing  a  suitably  large  parameterized  family  of  probabil¬ 
ity  density  functions.  (See  Lakoff  [150],  Hersh  and  Carramazza 
[109],  MacVicar-Whelan  [167],  Kuzmin  [147],  Kochen  and  Badre  [144], 
and  Dubois  and  Prade  [51],  pp.  255-264,  for  various  modeling  of 
possibility  functions  and  their  operators.) 

In  addition,  the  work  of  Nanlun  [181]  should  be  mentioned  in 
regard  to  empirical  verification  of  fuzzy  set  membership  functions 
through  statistical  data  (random  "appearences" )  in  histograms  repre¬ 
senting  overlapping  classes  -  hence  not  necessarily  adding  up  to 
unity . 

Finally,  the  comprehensive  approach  of  Norwich  and  Turksen 
[194'']  in  modeling  membership  functions  should  be  mentioned  where  a 
number  of  representation  and  uniqueness  theorems  are  obtained. 
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2.3.6  Families  of  truth  functions. 

At  this  point,  let  us  summarize  briefly  some  of  the  high¬ 
lights  of  fuzzy  set  theory  as  originally  conceived  by  Zadeh  [2761] 
and  some  natural  extensions  of  it.  (Leech  ((153],  pp.  38-41  and 
Chapter  6  presents  "fuzziness"  from  a  linguistic  viewpoint.) 

The  basic  building  block  in  the  development  of  fuzzy  set  theory 
is  the  membership  function 

4>:  X  -  [0,1] 


which  uniquely  corresponds  to  a  fuzzy  subset  A  of  ordinary  set  X  . 
If  the  range  of  $A  is  a  subset  of  (0,1)  ,  then  reduces  to 

the  classical  ordinary  characteristic  or  set  membership  function 
found  in  most  textbooks  on  real  analysis  and  Lebesgue  integration. 
In  a  related  vein,  G.S.  Goodman  [93]  has  pointed  out  that  the  col¬ 
lection  of  all  fuzzy  subsets  of  a  closed  interval  may  be  considered 

* 

in  a  sense  the  weak  -  completion  (i.e.,  pointwise  convergence)  of 
the  collection  of  all  ordinary  set  membership  functions.  Operations 
on  fuzzy  set  membership  functions  -  or  ,  in  short,  possiblity  func¬ 
tions  -  therefore  extend  those  on  ordinary  sets.  Zadeh  originally 
considered  (pointwise)  1  -  (•)  ,  min,  and  max  as  the  basic  fuzzy 
set  operations  for,  complement,  intersection,  and  union,  respec¬ 
tively.  However,  after  some  thought,  it  is  clear  that  many  other 
extensions  for  these  operations  can  be  considered  as  alternative 
definitions  for  fuzzy  set  complements,  intersections,  and  unions. 
For  example,  complement  could  be  represented  by 


But  this  tnot  fails  to  satisfy  the  involutive  property  and  is  not 
Co,.  I inuous . 

All  of  the  above  definitions  for  <j>  t  satisfy  the  boundary 
conditions  in  eq.  (*)  and  are  non-increasing  functions.  We  will 


100 


Goodman  and  Nguyen 


call  sucn  a  unary  operator  ynot  :  C 0 , 1 ]  -»  [0,1]  a  negation  opera- 
t  or  . 

Intersection  could  be  represented  by 
<p&(x,y)  =  prod ( x ,  y )  =  x- y  , 

or  by 


<f>&(x,y)  =  maxsum(x,y)  =  max(x  +  y  -  1,  0)  , 

or  as  proposed  by  Zadeh  [276'] 

$&(x,y)  =  min(x,y)  , 

etc.,  for  all  x,y  e  [0,1]  .  All  of  the  above  definitions  are 

bounded  above  by  min  ,  are  nondecreasing  in  each  argument,  are  sym¬ 
metric,  continuous  and  associative,  as  well  as  satisfying  the 
0-1  classical  logic  boundary  conditions 

4>&(x,l)  =  <?&(l,x)  =  x  ,  4>&{0,x)  =  $&<x,0)  =  *&<0,0>  =  0  , 

for  all  x  €  [0,1]  .  This  implies  that  may  be  extended  to  any 

o 

number  of  arguments.  Such  a  function  <|>&  :  [0,1]  -♦  [0,1]  is  called 

a  t-norn.  In  a  related  manner,  union  could  be  represented  by  a 

2 

function  $or  :  [0,1]  -•  [0,1]  which  is,  similarly,  nondecreasing  in 

arguments,  symmetric,  continuous  and  associative  so  that  is  is 
extendable  unambiguously  recursively  to  arbitrary  multiple  argu¬ 
ments.  In  addition,  <}>  is  bounded  below  by  max  and  satisfies 

or 

the  boundary  conditions 

*or(1-x)  =  $or (x>  1  ]  "  =  *or{l’0)  =  *or(1,1)  =  1  ' 

<for(0,x)  =  4>or(x-°)  =  x  ;  all  x  €  [0,1] 


Possible  definitions  for  i>  include: 

Tor 

lt>or(x,y)  =  probsum(x,y)  =  x  +  y  -  xy  =  1  -  ( 1  -  x)  ( 1  -  y )  , 


or 


<}>or(x,y)  =  bndsum(x,y)  =  minsum  (x,y)  ^  min(x  +  y,  1) 

or  Zadeh's  originally  proprosed 

$or(x'y)  =  max(x,y)  ;  all  x,y  €  [0,1]  , 

among  infinitely  many  possible  extensions.  Such  a  function  A  is 

or 

called  a  t-conorm.  (For  background,  see  Klement  [140],  Goodman 
[91],  and  the  comprehensive  text  of  Schweizer  and  Sklar  [226].  See 
also  Frank  [61]  and  Ling  [159]  and  Alsina,  et  al.,  [4].) 
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Originally,  t-norras  and  t-conorms  arose  from  the  concept  of  a 
probabilistic  metric  space,  developed  independently  of  fuzzy  set 
theory  [226].  The  recent  paper  of  Czogala  and  Drewniak  [44']  ex¬ 
tends  the  definitions  of  t-norms  and  t-conorms,  including  changes  in 
min-bounds,  max-bounds  and  other  properties  and  applies  these  to 
random  variables  and  fuzzy  probabilistic  sets.  (See  also  the 
empirically-oriented  paper  of  Czogala  and  Zimmermann  [44"].)  In 
addition,  intermediate  operators  or  generalized  means  are  considered 
in  some  detail  by  Dyckhoff  and  Pedrycz  [52']. 

Let  ,  <p£  ,  <J>  be  any  negation  operator,  t-norm  and 

t-conorm,  respectively.  Then  since  we  may  extend  unambiguously, 
and  $or  recursively  (using  symmetry  and  associativity)  by,  e.g. 

Vxi*J W  =  W’VV  'x3> 

=  VxrVx2'x3>>  ' 

etc .  , 

then  <$>  r  ,  *  :  [0,l]n  -»  [0,1]  are  well-defined  for  all  n  >  1  . 

u  or 

For  n  =  1  ,  define  <&.(x)  =  $  (x)  =  x  . 

Call  the  triple  of  operators 

7  =  ( <t  .  ,  <t>  _  ,  4  ) 

Tnot  Tor' 

a  general  ( ruzz y)  set  system,  operating  on  some  given  class  of  fuzzy 
sets.  Such  a  system  extends  the  classical  Aristotelian  two-valued 
(0,1)  logical  system. 

A  general  fuzzy  set  system  is  said  to  be  DeHorgan  iff  for  any 

given  <t>  .  , 

3  Tnt 

*nt(<1,&(x'y})  =  ^cr^nt (x)  >  ^nt(V)) 

^nt(Vx'y)  =  VWX)'  *nt(y))  ' 

for  all  x,y  «  [0,1]  ,  etc.  For  simplicity,  for  all  of  the 

following  definitions,  assume 

4>nt  =  1  -  (•).  i.e.,  fnt(x)  s  i  -  x  . 

Denote  7)  as  the  class  of  all  DeMorgan  systems.  Thus, 

?0  =  (1  -  ( • ) ,  min,  max) 

9  -  (1  -  (•),  prod,  probsum) 

and 

7^  =  (1  -  (•  )  ,  maxstim ,  bndsum) 
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are  all  in  ,  but,  e.g., 

7'  =  (1  -  (•),  prod,  bndsum)  is  not  in  $  . 

Note  that  if  is  any  t-norm, 

<J>  (x,y)  -  1  -  (  1  -x,  1  -  Y)  ,  all  x,y  e  f  0 , 1  ]  ,  always 

determines  a  well  defined  t-conorm  and  hence  a  DeMorgan  system. 

A  copula  is  a  cumulative  probability  distribution  function 
2 

(c.d.f.)  over  R  such  that  its  two  one-dimensional  marginal 
c.d.f. 's,  each  over  R  ,  are  identical  in  form  to  the  c.d.f.  of  the 
uniform  random  variable  U  over  [0,1]  ,  i.e.,  the  c.d.f.  FQ  of 

U  is  at  any  x  €  [0,1] 

F  Ivl  =  x  . 

0  '  ' 

It  can  be  shown  that  not  only  any  associative  copula  is  also  a 
t-norm,  but  more  generally,  defining  for  any  fixed  integer  n  >  1 

an  rt-copula  as  a  c.d.f,  over  Rn  ,  all  of  whose  one-dimensional 
marginal  c.d.f. 's  are  the  same  as  FQ  above  (Sklar  [238]): 

(a)  All  associative  n-copulas  are  t-norms. 

All  t-norms  which  are  probability  distribution  func¬ 
tions  in  n-arguments  are  n-copulas.  For  any  n-copula,  <f>& 

and 

x  =  (x1 . xn)  ,  y 

I  4>&(x)  -  $&{ y) | 

3-1  '  J 


(yr 

•••'V 

e  [0,1]" 

n 

z 

j=l 

1  4>&(xj 

)  -  VYj) 

n 

z 

lx.  -  v 

,  1 

and  hence  is  uniformly  continuous  over  [0,1  ]n  . 

(b)  For  any  arbitrary  c.d.f.  F  over  Rn  ,  there  exists 
a  unique  corresponding  n-copula  4>&  such  that 

F(xl . V  =  VFl(xl> . W1  : 

al  1  x .  x  €  R  ,  (**) 

In 

where  Fi-''--,Fn  are  the  n  one-dimensional  marginal 
c.d.f.  's  over  R  for  F  . 

Indeed,  =  F ( F* (•),..., F^ ( • ) )  .  (See  subsection  on 

pseudoinverses  of  distribution  functions.) 

(c)  For  any  collection  of  n  one-dimensional  c.d.f.  's 

F ,,...,  F  over  R  and  any  choice  of  n-copula  ,  de- 

1  n  & 
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fining  F  by  eq.  (**)  yields  F  as  a  legitimate  c.d.f. 

n  t  ^1 

ever  R  .  If  is  a  given  copula  such  that  the  n 

iterate  of  <J>&  is  an  n-copula,  we  say  <J>&  is  an  n-copula 
by  abuse  of  notation. 

We  call  the  above  two  results,  Slclar's  Theorem. 

(1)  First,  note  that  if  V  and  W  are  any  0-1 

valued  random  variables  with  Pr(V  =  1)  -  x  and 

Pr(W  =  1)  =  y  ,  say,  0  <  x,y  £  1  ,  then  the  following 
bounds  occur:  for  the  possible  evaluation  of  the  joint 
probability  Pr(V  =  1  &  W  =  1)  and  Pr(V  =  1  or  W  =  1 )  . 

maxsum(x.y)  £  Pr(V  =  1  &  W  =  l)  £  min(x,y) 

max ( x , y )  <  Pr|V  =  1  or  W  =  1)  i  bndsum(x,y)  ,  (***) 

for  all  0  <  x,y  £  1  ,  the  upper  bound  occurring  for  max¬ 
imally  correlated  V  and  W  ,  while  the  lower  bound 
occurs  for  V  and  W  most  negatively  correlated.  Note 
that  the  case  of  V  and  W  being  statistically  indepen¬ 
dent  yields  Pr(V  =  1,  W=  1)  =x-y  ,  somewhere  in  the 
middle  of  the  two  bounds. 

(2)  Let  <p&  be  any  (2-)  copula.  Then 

maxsum ( x , y )  i  $&(x,y)  s  min(x,y)  ;  all  0  <  x,y  <  1  , 

with  the  boundaries  also  being  copulas,  since  we  can 
write,  for  all  0  5  x,y  £  1  , 

$&(x,y)  =  Pr(V'  <  x,  W*  <  x)  , 

for  some  uniform  [0-1]  r.v.'s  V',W'  over  R  .  Then 
define  0  -  1  r.v.'s  Vx,Wx  by: 

d  f  1  iff  V-  <  x 

x  to  iff  V 1  >  x 

if1  iff  W'Sx 

x  to  iff  w  >  x 

Hence , 

$&(x,y)  =  f,r(Vx  =  1.  wx  =  1)  , 
and  we  may  apply  eq .  (***)  . 

(d)  For  any  integer  n  >  1  define  as  the  class  of 

all  DeMorgan  n-copula  systems  for  =  1  ~  (’ )  •  Thus, 

for  any  such  system  for  n  =  2  is  easily  shown  to 

satisfy  the  relation 
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max(x,y)  s  4>or(x,y)  £  bndsum(x,y)  ;  all  0  5  x,y  £  1  , 

with  the  boundaries  also  achieved  by  <t>  being  the 

or 

DeMorgan  transform  of  copulas  min  and  maxsum. 

(e)  For  any  nil,  min  and  prod  are  n-copulas,  but 
maxsum  although  a  t-norm  is  not  an  n-copula  for  n  i  3  , 
noting  for  all  x1,...,xn  e  [0,1]  ,  the  evaluations 

min(x1 , . . . ,xn) 

prod(x1, . . . ,xn)  =  x1-*-xn  , 

maxsum(x1 , . . . ,xn)  =  max(x1+- • • +xn  -  (n-1),  0)  . 

Furthermore,  for  any  n-copula  ,  and  all  xlf...,x  e  [0,1]  , 


maxsum(x1 , . . . ,xn)  5  $&{x1 , . . . ,xn)  £  minfXj , . . . ,x  )  , 

noting  that  for 

X  =  (x1,...,xn)  s  y  = _ (y1j_...,yn)  e  [0,l]n  ,  the  variations 

over  the  n-interval  J(x,y)  are 


VarJ(x,y)(min)  *  max{ 0 , min( yx , . . . ,yn) ) -maxjXj , . . . , xn)  , 
VarJ(x,y)(prod)  =  (*1  ~  xl>--(Yn  -  V  ' 

and 

Var  (maxsum)  =  1  -  ^  , 

J<lin'in> 

and  hence  maxsum  is  not  an  n-copula  for  n  i  3  (but  it 
can  be  verified  to  be  a  copula,  i.e.,  2-copula)  .  Denote  the 
class  of  all  n-copulas  by  C 

n 


(f)  Consider  now  Yager's  family  of  t-norms  [275']  ,  $ 

«  /  P 

For  all  real  p  ,  1  <  p  <  +»  ,  each  .  is  also  an  n-copula 

for  any  nil,  since  generator  h  (t)  =  (l-t)p  as  a  function 

P 

of  t  e  [0,1]  yields  hp1(y)  =  1  -  y1/,p  ,  a  complete  monotone 

function  over  R+  .  (See  later  in  this  section  for  definitions 
and  properties  of  generators.) 

It  also  follows  easily  that  for  any  nil 


1  im+ 
P-1 


<P 


&, (p) 1 


( 1 ) (X1 ’ • 

•••v 

=  maxsurafXj , . 

■■'*») 

uniformly  in  all  x  =  (Xj, . . . ,x  )  «  [0,l]n  . 
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In  particular,  let  x  e  [0,l]n  be  arbitrary  and 
consider  maxsum(x)  : 

x, +• • • +x„  <  n  -  1  . 
l  n 


This  implies 

(1  -  Xj)+* • •+{!  -  xn)  >  1  . 


Hence,  there  is  a  p  >  1  such  that 


whence 


Vp1*! 


1  -  Xjl'V  • -  +  (1  -  xn)p  >  1  , 

n  1  . 

, x  )  =  1  -  (mint  z  (1  -  x.)p,  l))i/p 

n  J=1  J 

=  0  =  maxsum(x)  . 


In  any  case,  the  above  approximating  property  shows  that 
the  lower  bound  for  n-copulas  by  maxsum  cannot  be  improved 
upon.  Similarly,  for  any  n-copula 


max(x1 ,  .  .  .  ,xn)  <  <for(x1 . xn)  s  bndsum(x1 ,  .  .  .  ,xn)  , 

with  Yager's,  $or  as  p  1  +  uniformly  approximating  the 
upper  bound  bndsum  (the  latter,  not  an  n-copula  for  n  >  3)  . 


A  general  fuzzy  set  system  7  is  said  to  be  mutually  distribu¬ 
tive  iff 

=  <t>or(Vx.y)  (x'z)) 

<t>or(x.  ^&{y,z))  =  ^&^or^x'Y)  '  ♦ or(x'z) )  ' 
for  all  x,y,z  «  [0,1]  . 

A  general  fuzzy  set  system  7  is  / dempotent  iff 
<t»&(x,x)  =  <J>or(x,x)  =  x  ;  all  x  e  [0,1]  . 

A  DeMorgan  general  fuzzy  set  system  is  said  to  be 
semi -d i sir i but  I ve  iff,  for  all  x,y,z  e  [0,1]  , 

<P&(x,  1>or(y.*))  =  4>&(x,y)  +  4>&(x,z)  -  (J»&(x,y,z))  . 

Let  f  denote  the  class  of  all  semi-distributive  families. 


A  general  fuzzy  set 

system 

7 

is  said 

to  be  Archimedean  iff 

$&(x,x)  < 

X  ; 

for 

all 

o 

A 

<  1 

4,or(x,x)  > 

x  ; 

for 

all 

0  <  X 

<  1  . 

and 
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Denote  the  class  of  all  Archimedean  systems  by  at  . 

Note  that  FQ  =  (1  -  (•),  min,  max)  is  not  Archimedean.  The 

following  important  result  will  be  useful: 

For  any  x,y  €  [0,1]  and  any  t-norn  $&  ,  if 

<p  (x.x)  =  x  or  $&(y,y)  =  y  .  then  $&(x.y)  =  min(x.y)  . 


( Proof :  Suppose  x  <  y  . 

(1)  If  $&(x,x)  =  x  ,  then  since 

x  =  4>&(x,x)  s  <fi&(x,y)  S  <p&(x,l)  =  x  , 


the  result  follows. 


(2) 

If 

<}>&(Y.y)  =  Y  and 

4>&(X,X)  <  X 

(by  case  ( 1 )  )  , 

assume 

<t>&(x,y) 

<  x  . 

The  function  f  = 

$&(-  -Y)  is 

continuous  with 

fy(x)  < 

x  <  y 

=  fy(y)  •  Hence 

there  is  a 

z  ,  x  <  z  <  y 

with 

f  ( z )  = 

x  , 

Y 

X  >  <t>& 

( x , x )  =  <f>&(<t>&(z,y)  , 

y)  =  $&(z. 

<?&(y.Y) ) 

=  <f&(z,y)  =  x 

yields  a  contradiction  to  the  assumption. ) 

An  Archimedean  system  ?  is  strict  iff  and  <f*or  are 

2 

strictly  increasing  over  (0,1) 

A  DeMorgan  system  9  such  that 

<t>&(x,y)  +  <*>&(x,  1_y)  =  Y  :  all  X'Y  €  [0.1] 

is  called  exchangabi e  . 

A  general  fuzzy  set  system  5  such  that 

$orU.y)  =  x  +  y  -  <f>&(x,y)  ;  all  x,y  «  [0,1] 


is  called  Frankian.  (See  Frank  [61J  for  properties.)  Let  ?r  be 
the  class  of  all  Frankian  families.  Note  that  the  Frankian  property 
has  essentially  the  same  form  as  a  modular  form  or  valuation  for 
operators  <$>&  and  $Qr  over  [0,1]  .  (See,  e.g.,  Birkhoff  [17].) 

Yager's  family  of  DeMorgan  fuzzy  set  systems  ?  has  the  forms 
(7jpj  |15ps  +«}  ,  where 


(P) 


=  (1  -  (•  )  ,  4> 


&,(p)’  ^or , (p) 


) 


wi  th 


(p 


(x.y)  =  1  -  (min((l-x)p  +  (l-y)p,  l))1/p 


all  x.y  e 


[0,1] 


,  etc . 


Denote  this  class  by  v  . 
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Note  that 

?  ^  ^  j  =  (1  -  (•),  maxsum,  bndsum) 

?(+06)  =  (1  -  (•),  min,  max)  . 

Let  J  be  any  at  most  countable  (or  finite)  non-vacuous  index 
set.  Let  5  -  |  j  e  j)  be  any  collection  of  Archimedean  fuzzy 

set  systems  s  .  =  (<t>  .  ,  *  A  (^)  ,  with  <t>  .  fixed  (usually 

1  j  Tnot  T&  Tor  Tnot  1 

to  1  -  ( •  )  )  .  Let  mapping  p  :  J  -*  J  be  given  and  also  let 

K  =  {[a,,b.]  |  0  S  a.  <  b.  s  1  ,  with  all  closed  intervals  [a.,b.] 

1111  11 
for  distinct  j's  be  disjoint,  j  e  p(J)  ,  such  that  u  [a.,b.] 

jep(j)  3  J 

c  (proper)  [0,1]}  . 

Also,  for  any  i  =  [a,b]  ,  0  £  a  <  b  £  1  ,  define  the  affine 
operator  L  :  [0,1]  -*  [0,1]  and  its  inverse  L  1  ,  by,  for  all 


u  e  [0,1]  , 


L^ (u)  =  a  +  (b-a) • u 
L7-1(u)  =  (u-a) / (b-a)  . 
L7(L7_1(u))  =  L7_1(L,(u))  s  u  . 


Then  the  ordinal  sums  $&((J>&,  P>  K)  and  $or(Jor/  P,  K)  are 
defined  by,  for  any  x,y  €  [0,1]  , 

L  [<p  (:i)  (L ,_1(x)  ,L ,_1(y)  )  )  , 

7j  *  7j 


^ & '  p '  K) (x'u)  = 


*or($or'  p'  K>(X'Y)  = 


iff  x,y  e  f 
for  some  j  e  p(j)  ; 
min(x,y)  , 

iff  x,y  e  [0,1]  are  otherwise 

Ly  (<Pnr(j)  (Lr~1(x)  '  L/1(Y)})  ' 

j  °r  1  j 

iff  x,y  €  7  , 

for  some  j  e  p(J)  ; 

max(x,y)  iff  x,ye  [0,1]  are 
otherwise  , 


where 


y j  -  [3j ,bj]  ,  j  €  p( J)  , 


Z '  P’  K>  S  (♦not'  V$*'  p-  K)  '  V($or'  p'  K)) 
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is  called  an  ordinal  sum  of  systems  7  ,  with  0(7)  defined  as  the 
collection  of  all  possible  ordinal  sums  over  7  fixed.  Similar 
notation  may  be  used  for  only  the  t-norm  or  t-conorm  part. 


Note 

that 

P  . 

K)  is 

locally  Archimedean  over 

each 

Uj.bjl2. 

since 

P  . 

K) ( x , x) 

<  x  for  all  x 

e  [a.,b.]  , 

and 

otherwise 

P.  K) 

is 

locally 

a  min.  Similar 

remarks  hold 

for 

V($or'  P-  K> 

Theorem 

(i)  Given  any  7  ,  p  and  K  as  above,  7(7,  p,  K)  is  a 

well  defined  non-Archimedean  fuzzy  set  system  -  since  for  any 
x  e  [o, 1]  -  u  [a  . , b  .  ]  , 
jep(j)  J  ■> 

P-  K)(x,x)  =  min(x,x)  =  x  , 
torpor'  p'  K)  (x<x)  *  max(x.x)  =  x 

(ii)  Any  t-norm  is  either  min,  an  Archimedean  t-norm  or 

an  ordinal  sum  (of  t-norms).  Similar  results  hold  for  any  t-conorm 
being  either  max,  Archimedean  or  an  ordinal  sum. 

Proof :  Suppose  is  neither  Archimedean  nor  min.  Then  let 

M  =  (x  I  X  €  [0,l]&  <p&(x,x)  <  x}  . 

It  follows  from  the  continuity  and  nondecreasing  argument  properties 
of  $£  ,  that  we  can  write  w.l.o.g. 

M  «  U  ta.,b  ]  , 

j€j  J  J 

for  some  non  vacuous,  at  most  countable  index  set  J  with  [a^b^J’s 
all  disjoint,  a^  <  bj  ,  and  with  M  c  ( proper )[ 0 , 1 ]  ,  such  that 

=  x  for  x  =  a.  or  b. 

*&(x,x)  <  x  for  all  <  x  <  b^  , 

j  €  J  . 

Now  for  each  j  e  J  ,  and  all  x,y  €  [a^.b^]  ,  j  e  J  ,  we  have 

the  identity 

V*.y>  =  S  {L7~1(t)&(L7  (L/^X)),  L7  (L  “1(y)))) 

J  j  j  j  j  j 

=  L  (cf>  (j)(I,  -1(x),  L  _1(y)))  , 

J  j  J 
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(u,v) 


(<t>  (L  (u), 

j 


(V)))  ; 


for  all  u,v  €  [0,1]  ,  where  it  can  be  shown  that  <J>  ^  ^  is  a 

2 

legitimate  Archimedean  t-norm  (over  [0,1]  )  . 

Also,  for  any  (x,y)  e  [0,1]2  I-  u  [a.,b.]2  ,  if  x  =  y  ,  then 

jej  3  3 

<$>&(x,x)  =  x  ,  by  construction.  If  x  <  y  ,  say,  then  there  exists 
z  ,  x  <  z  <  y  such  that  <p&(  z,z)  =  z  ,  from  which  it  follows  (see 
eq.  (I))  that  4>&{x,z)  =  min(x,z)  which  implies, 

<P&(x,  y)  =  min(x,y)  . 

Thus,  for  all  x,y  e  [0,1]  , 


<f&(x.y)  =  id,  K)  , 

'  l ) 

$>&  =  (^^  'jej  *  id  =  P  :  J  -  3  i  as  given  above. 

Similar  results  hold  for  any  t-conorm  with  respect  to  max, 
Archimedeaness ,  or  being  an  ordinal  sum. 

Extend  the  definition  of  L  and  L  to  be  defined  over 

7  7 

[ 0  > 1 ] 2  .  where  for  any  V  e  [0,1]2  ,  letting  12  =  ,  and 

i  =  (a.b)  , 

L, (V)  S  ai2  +  (b-a)-V 
L,_1<V)  S  (l/(b-a))(V-al2)  , 

yielding  L, (L,-1 (V) )  =  L^-1  (L.,  (V) )  =  V 

Thus,  if  F  is  any  cumulative  probability  distribution  func- 

2  -1 

taon  (c.d.f.)  over  [0,1]  ,  F°L^  is  also  a  legitimate  c.d.f. 

2 

over  [0,1] 

Hence  if  is  a  family  of  associative  Archimedean  copulas  so 

that  for  any  j  «  p(J),  <p  ^ ^  a  copula  and  hence  cumulative  proba¬ 
bility  distribution  function  (with  uniform  marginals), 

2 

is  a  legitimate  c.d.f.  over  [0,1]  ,  noting  that  for  any 

V  =  e  [0,1]2  ,  the  special  cases 


110 


Goodman  and  Nguyen 


Fj(V)  = 


V2  -  3J 
bj  " 


V1  ~  3j 


ifV€j  ={V|xSa.  or  y  £  a.) 

^  •  3  J  J 

if  V  «  Jj  j  *  (V  |  x  1  ^  and  a^  s  y  s  b  ^ } 


iff  V  e  J3  j  =  {V  |  y  i  and  a^  s  x  s  b^} 

if  V  e  J  .  =  {V  |  x  £  b.  and  y  i  b.)  . 

’  •  J  J  3 


Suppose  w.l.o.g. 

0  s  <  b1  <  a2  <  b2  <  • • ■  s  1  . 

Define  intervals 

T  j  =  tbj-l'  aj3  '  J  =  1<2'  ’  ' ' 

where  b  =  0  ,  if  >  0  ,  etc. 

Also,  denote,  as  before,  min  to  be  the  t-norm  representing 
mini-mum,  noting  that  min  is  a  legitimate  c.d.f.  over  [0,1] 
representing  the  distribution  of  a  point  uniformly  distributed  over 

2 

the  main  diagonal  of  [0,1]  .  Indeed,  min  is  a  non-Archimedean 

copula.  Then,  for  j  =  1,2,3,...  , 

G.  =  min°L  -1 

3  j 

2 

is  also  a  legitimate  c.d.f.  over  [0,1]  ,  where 

x  -  b,1  y  -  b 

G.(V)  =  max ( 0 ,  min(- - c -  - £ -  ,  1)) 

J  3j  "  J-l  3j  "  bj-l 

with  the  special  cases 


Qj(V)  = 


v2-bj-i 


arbj-l 


vrbj-i 


aj-bj-i 


if  V  €  K1(j  i  {V  I  x:  s  bJ_1  or  y  s  b  > 

if  V  e  K  =  {V  |  x  ^  a  and  b.  <  y  <  a.; 

*• '  3  3  3~1  3 


if  V  e  K  .  =  [V  |  y  i  a.  and  b.  <  x  <  a.) 

0  <  J  J  J~ 1  3 

if  V  e  K  .  =  {V  |  x  £  a.  and  y  i  a.) 

4  /  J  J  J 


It  readily  follows  that  all  i  * ,  t-..2,  J2  y  ^ ,  K2  y  K3  . , 

j  =  1,2,3,...  are  essentially  disjoint  (except  at  boundaries  -  see 
diagram  below) . 
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By  using  these  properties,  it  can  be  verified  directly  that  any 
ordinal  sum  of  associative  copulas  is  also  an  at  most  countable 

o 

probability  mixture  of  c.d.f.'s  over  [0,1]  : 

$(<}>,  p,  K)  =  z  (bi-a .)  ’F.  +  z  (a.-b.  )-G  , 

j2l  J  J  J  j^i  J  J  J 

where 

*  ♦s(JI°Lr‘  • 

(  J  }  (x,y)  =  h  j-1  ( min(h^  { x)  +  h^y),  hj(0)))  , 

where  hj  :  [0,1]  -*  [0,+»]  is  monotone  (strict)  decreasing  convex 

with  hj(l)  =  0  ,  0  <  hj(O)  ^  +«•  »  and  is  as  before. 

Now 

(i)  For  all  V  =  [*]  with 

V  e  M.  =  {V  |  V  e  [0,1]2  &  hj (x)  +  h .  ( y )  2  h(0) }  , 

J  j  1 

<52<f&(j)(x,y)  _  520  _  ^ 

iJx5y  ~  <5xtfy 

(ii)  For  all  V  =  with 

V  «  Nj  3  (V  |  V  e  [  0 , 1  ] 2  &  hj(x)  *•  hj  ( y )  S  h(  0 )  >  , 

j)  (x.y)  d2h  “X(h  (x)  +  h  .  ( y ) ) 

0  i  _ -  =  - d _ d _ d _ 

tfxtfy  tfxtfy 


-Dh  (xJ-Dhjty)*  (D2hj)  (<$)&(  J)  (x,y)  ) 
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=  (D2^-1)  (hj(x)  +  hj  {y )  )  •  Dhj  (x)  •  Dh  ^  (y) 
=  q  (hj )  (x, y) 

s  D2hj(<}>&(  j)  (x,y)  )/|Dhj(4>&(jl  (x.y)  )  | 

=  *  (h..)  (x.y)  , 

by  using  the  fact  that  always 

<t>&(^{x(y)  £  min(x,y) 


and  the  convexity  of  h^  . 

Hence,  for  all  V  =  e  [0,1]2  , 


<52F  (x.y) 

-  _ 1 _  < 

tfxtf  y 


■<7  ( h  . )  (L  1  ( V)  )  *(h,)(L  -1  ( V)  ) 

- 3 - -j — 5 —  S  - 3 - Ij— —  . 

,bJ  -  V  lb3  -  V 
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and 

<}>&  +«(X'Y)  =  max(0,  x  +  y  -  1)  =  maxsum(x,y) 
all  x , y  «  [0,1]  , 

and  clearly  h  [0,1]  -♦  [0,+°>]  is  nonincreasing  convex  with 

s 

hg(i)  =  o  ,  hg(0)  =  +«  , 


Dhs(x)  =  -log  s-  — 


=  -log  s-  ( 1  + 


s  -  1 


sx  -  1 


D2hg(x)  «  (log  s)2-  — 


.X 


(s  -  1  ) 

Z  0  . 

Note  also  that  since 


.  2 


=  (log  s )  2 •  (  -  * — -  + 


x  ,  .  x  . .  2 

s  -  1  (s  -  1) 


-log f-(~ — ~ — —  (ST'~-1--]  =  h  (x)  +  h  (y) 
1  (s  -  l)2  J  s  s 


=  hs(<»,«,s(x'y,) 


=  -log 


*&,s(x'y) 


-  1 


l  S  -  1 


<?>*  v  v 

S  -  1  =  (sx  -  l)(sy  -  l)/(s  -  1)  :  all  x,y  e  [0,1] 

Thus,  for  all  x,y  e  [0,1]  , 


(D2hg)  (<J>&  s(x,y))  =  (log  s)2- 


Dhs(Vs(x'y)  )  =  ^  -log  s) 


s  “  1 


(s  -  ir 


l  (s'*  -  1)  (s'  -  1)  (s~  -  l)2(sy  -  l)2 
s  -  1 


1  + 


(Sx  -  l)(sy  -  1) 


and  thus 


n (hs) (x,y) 


- ( log  s) 


fl-  1 

fl+  i-1 

S  -  1  ,  (s  -  l]2  1 

l  sx-lj 

sy-lj 

,(SX-  l)(sy  -  1)  (sx-l ) 2 (sy-l ) 2 . 

(- 


log  s)3-  f  1+ - ^ - - - 

(Sx-l)(sy  -  1,J 


(log  s)-(s  -  1 ) • ( s  -  1  +  l)(sY  -1+1) 
( (sX  -  1) (sY  -  1)  +  (s  -  l}2 
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_  ( log  s ) ( s 


.x+y 


n-sx*y 

sy  +  s) 


2 


5  log  s- (s  -  1) 


=  log  s- (s  -  1 ) 


sup  (u  +  1 ) ( V  +  1 ) 

(Oiu, vis-1)  (u-v+  (s  -  1)) 

1 


2  ' 


(S  -  1) 


(s  -  1) 


If  s  <  l  ,  occurring  for 
u  =  v  =  0  or 
u  =  v  =  s  -  l 

if  s  i  i  ,  occurring  for 

u  =  0 ,  v  =  s  -  l 
u=s-l,  v=0 


_  log  s  , 

-  ~-g-  1  max(s ,  1 )  , 

by  simple  inspection. 

Note  also,  for  example, 


*<i>&iSU,y) 


Dhg(y) 


, s ( x  <  Y ) ) 


y  s 
s 1  + 


sx  -i 


noting  for  all  s  *  0  ,  z  0  .  x,y  e  [0(1]  . 


sx  -  1 


Thus,  in  turn,  for  all  s  i.  o 


3  s(x'Y)  ,«=  Y  1 

* ' s  _  _  (s-s  )s 1  log  s 


<?2y 


sy  +  £  -a 

sx  -  1 


for  all  x , y  e  [0,1]  . 

Hence,  for  all  x,y  €  [0,1] 


*  » &,six >y) 
<5y 


s-(sx  -  1 )  ,  if  1  <  s 


S  (s  -  1) 
X 


x 

sx  -  1 

S  -  1 

X 


if  0  <  s  <  i 
if  s  =  1  , 


Simi larly , 


d  Vs(X'y) 


<52y 


s  i2i*  .  .  „  . 


Note,  finally,  the  evaluation 
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a  Vslxl . Xk) 


axl . axk 


Xl=. .=xk  =  0 


,  k-i 


flog  si" 

Is  -  lj  ,  0  <  s  < 


+«  ,  S  *  1 


1 

0 


,  S  =  1 


,  if  S  =  -H» 
for  k  2  2 


For  Yager's  Family  <j> 


0  S 


0  < 


aVP(x'y) 

<5y 


&.  (P) 
1 


'•  t  f  1  -  xlpl  (p-l)/p 

U  -  yj 


,  all  x,y  e  [0,1]  , 


3  - -  (  ( 1  -  x )  /  ( 1  -  y)  )p+1 


&.  (p) 

3*  y 


1  x  (l  +  {  !  1  -  x)/(  1  -  y))p]2“(1//p) 


all  x.y  «  [0,1]  . 

a2<^&  p<x-y>  p  -  i 

For  1  <  p  <  2  ,  - - -  is  the  product  of  -  and  a 

<5  y 

1  -  V 

nondecreasing  function  in  -j — — —  with  limit  as  y  -*  1  being  +«>  . 


a  V(p)(x'y) 


<52y 


a  0  ;  for  p  -  1  , 


a  *&, (p) (x'y)  <  p  -  1  (P  +  l)1+(1/p)- (p  _  2)l-(2/p) 


02y 


for  2  5  p  <  . 


1  -  x 


Up  -  1) 


2- (  1/p ) 


Also,  for  all  n  2  1  ,  p  2  l  ,  all  Xj,...,x  «  [0-1] 

probability  density  function,  when  non-zero  is 


o  <; 


a2^,(p)(xl'-'-,xn) 

<?x,  •  •  •  Ox 
1  n 


the 


1- (P-1)- (2p-l)- •  ((n-l)-p-l)- (l-x^P  1--(l-xn)p  1 
(  d-x1)p+-" •  •  +  (i-xn)p)n"(  1/p) 

1  •  (p-1)- (2p-l)- • (n-l)-p  -  1)  •  n~(n_(1/p,)  . 


Bellman-Giertz  [14]  and  related  theorems 

(i)  Bellman-Giertz  theorem  extended: 

Given  any  general  fuzzy  set  system  7,7  is  mutually  dis¬ 
tributive  iff  7  is  idempotent  iff  7  =  7Q  .  (See  [14].) 


116 


Goodman  and  Nguyen 


(ii)  From  Goodman  [91],  we  have; 

Given  any  general  fuzzy  set  system  5  ,  ?  is  exchangable 

with,  for  all  x,y  e  [0,1] 


<54>&(x,y) 


<5y 


existing , 


y=0 


iff 


5=5, 


(iii)  Given  any  general  fuzzy  set  system  5,5  is  semidistri¬ 
butive  iff,  for  all  finite  index  sets  JQ  ,  and  all  x.  e  [0,1] 


->  *  J0  ' 


and 


,  ,  ,  _  ,  , \ card  k+1  ,  ,  . 

^or  (xj)  =  2  (_1)  (xi)  ' 

jej  0  jeK 


K  (*>  -  r  (-l)card  k+1-4.s  (x,)  , 
WKJ0  j,K 


noting  the  similarity  of  the  above  relations  with  the  alternating 
(Poincare)  sum  expansions  of  probabilities  of  unions  of  events  in 
terms  of  intersections  and  vice-versa. 


Basic  canonical  expansion  theorems 

Ling  [159]  and  others  (see  [226],  [140])  have  obtained  the  fol¬ 

lowing  : 

Given  any  general  fuzzy  set  system  5  , 

(a)  5  is  Archimedean  iff  there  are  monotone  decreasing  continuous 

functions 

h ,  h  •  :  [0,1]  [  0  .  +®  ] 

with  h(l)  =  h'(l)  =  0  ;  h(l),  h'(lj  S  +<»  ,  called  generators,  such 
that  for  all  xlfx2,...,x  e  [0,1],  nil, 

-1  n 

. x  )  =  h  (min{  z  h(x.),  h  ( 0 )  )  )  . 

X  j  =  1  J 

-1  n 

<f'or(x1 . xn)  =  1  -  h*  (min{  Z  h'(l-Xj),  h'(0)))  . 

h,h'  are  unique  up  to  positive  multiplicative  constants. 

Conversely,  any  choice  of  such  h,h'  as  above  generate  an 
Archimedean  system  5  . 

5  is  strict  iff  h(0)  =  h'(0)  =  ,  and  this  the  min  operator 

above  may  be  omitted.  Also,  iff  5  is  DeMorgan  (for  $  =  l-(*))  . 

h  *  h '  . 
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(b)  <f>  is  involutive,  continuous,  and 


<5>not(x)  -  4>not(Y)  sy-x;  1  S:  y  >  x  >  0  , 


iff  $not  =!-(•>  • 


(c)  For  any  involutive  <6  .  ,  there  is  a  monotone  increasing 

not 

continuous  function  V  :  [0,1]  -»  R+  ,  with  V(0)  =  0  such  that  for 

all  x  €  [0,1]  , 

<t>not(x;  =  v’1<v<1)  "  V(x)  >  • 

( d )  Moynihan's  theorem  [2261. 

Given  any  DeMorgan  fuzzy  set  system  for 
♦not  =!-<•), 


■<P&(x.Y)  -  <|>&(x,z) 

f  e  C  iff  0  < 

2  <f»  ( x ,  y )  -  $  { x ,  z ) 

[”or  1  Tor 


^  z  -  y  ;  for  all 


x,y,z  «  [0,1]  with  y  i  z  . 

(e)  Given  any  general  fuzzy  set  system  ?  ;  5  e  n  Ar 

iff  the  monotone  generator  h  is  convex.  The  nth  iteration 

is  an  n-copula  system  iff  h  is  convex  and  h  1  is  a  completely 
monotone  function  up  to  degree  n  ,  i.e., 

(-i)J-d-’h  1(x)/d-*x  Z  0  ,  for  j  =  0,1,2 . n  ;  all  x  . 


( f )  Frank's  theorem  (Frank,  [61]) 

Jr  =  (Jrnjtr)  u  u  o(Jrnxr)  , 

Jr  n  at  =  (7  I  0  <  s  s  +<=}  s  m  (for  *  .si¬ 
s'  ’  Tnot 

%  a  11  -  >■»  •  V.-  V.s'  • 

and  for  all  x,,...,x  e  [0,1]  ;  0  <  s  3  +»  , 

1  n 

n  x  . 

<P&  ,s(Xj . xn)  =  log  ( 1  +  n  ( s  3  -  l)/(s  -  l)n  1)/log  s 


=  h  (  r  h  (x  ) )  , 
j  =  i  5  3 

*or,s(xi . V  =  1  "  Vs'1  -  xl"--'1  -  V 


=  1  -  h  "(  r  h  (  1  -  x.)  )  ,  etc.  , 

j  =  l  S  J 
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where  generator  hg  is  given  by 


hg(x)  =  -log((xx  -  1 )  / ( s  -  1)) 


and  where  the  cases  s  =  0  ,  1,  +<*>  are  derived  by  limit  consid¬ 
erations  (?Q,  5 1  *  already  having  been  introduced).  In 

particular,  although  hQ  does  not  exist  as  an  ordinary  function, 
for  all  x  e  [0,1] 

hj (x)  =  -log { x)  , 
h+~(x>  "  1  "  x 

(See  Frank  [61],  Goodman  [91].)  Also, 

S  £  cn  ;  for  all  n  2  2  , 

*  =  (*0.  a  o(9qi  9x) 
s  *r  £  c2  , 

<V  V  s  <*.  !  0  s  s  5  1>  S  C„  =  C  2 


for  all  n  1  2  , 


for  n  k  3  , 


»+-  «  c2  -  c 
■9  -  £  cn  n  jir  , 

5  ,  €  C  -  . 

+«>  2 


For  all  n  >  2  , 


e  =  (?n)  0  (e  n*r)  0  o(c  nir)  . 


n 


Let  ?  =  (<p 

not ' 

<3 

•e- 

,  $ 
Tor 

)  be 

any  fuzzy  set  system.  Then 

f  j  (x 

.y) 

i3 

■e- 

VI 

(x,y) 

£  min(x,y)  , 

max(x 

.Y) 

s  4>or(x,y) 

<  f  2  ( x ,  y )  , 

for  all  0  3  x,y 

<  1 

where 

f  and  f  is  a  non-cont i nuou s  t-norm 

and  t-conorm,  respectively,  ’ 

which 

are  in  a  DeMorgan  relationship  for 

*not  =  1  -  <*)  ' 

with 

X 

iff 

Y  = 

1 

)  fl  : 

Y 

iff 

x  = 

1 

.  0 

iff 

0  * 

x ,  y  <  1 

X 

iff 

y  = 

0 

f  2 ( x  *  y 

)  3 

Y 

iff 

X  = 

0 

.  0 

iff 

0  < 

x,y  <  1 
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for 


Note  that  for  any  e  U  <  1  i  p  s  +® 

*&.(p)  and  ^or ,  { p)  may  be  written 


the  generator  h,  . 

(p) 


h(p)(X)  =  (1  -  x)p  ;  all  x  cf»  [0,1]  . 


Extend  now  Yager's  family  to  include  all 

0  <  p  <  1  ,  with  generator  h(p)  given  above. 

J/ni  <  we  can  compare  d>  ,  „  and  * 

(P}  V&.(P)  n  9or,(p) 

bounds  for  copulas  (2):  Since  for  any 


’<£.)  f°r 
Then  for  any  such 

with  the  previous 


1  £  x,y  >  0  ;  0  <  p  <  l  , 


(XP  +  yP)l/P 


>  x 


y  . 


it  follows  that 


<f&,(p)(x,Y)  5  maxsum(x,y)  ;  all  x,y  e  [0,1] 

with  strict  inequality  holding  for  all  x,y  e  [0,1]  such  that 

(1  -  x)p  +  1  -  y)p  <  i  .  (A] 

Similarly,  by  taking  the  DeMorgan  transform, 

V,  (p)  (x'Y>  ~  bndsum(x.y)  ;  all  x,y  e  [o,i]  , 

for  0  <  p  <  i  ,  with  strict  inequality  holding  for  all  x,v  as 
above  in  eq,  (A). 

Moreover,  it  easily  follows  that 


t&,(p)(X'Y)  =  fl(x'Y> 


and 


1  im 

p~»0 


yor , (p) 


(x,y) 


f2(x,y)  , 


for  all  x,y  «  [0,1]  ,  so  that  these  bound  relations 
eral  be  improved,  noting  <5(p)  I  0  <  p  <  1}  s  Ar 

<7(p)  1  0  <  P  <  1)  n  c2  =  0  . 


cannot  in  gen- 
but  certainly 


Min  and  max  can  be  considered  as  limiting  cases 
medean  t-norms  and  t-conorms: 

Consider  Yager's  DeMorgan  Archimedean  families  of 
t-conorms 


of  Archi- 
t-norms  and 


Vpui 


■V 


n 

(min(  z  h  (x)  )  , 
j  =  l  p  J 


1)  ) 
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*or,p<xl . xn>  =  1  '  ^&,P(1  "  X1 . 1  ‘  Xn) 


=  1  -  h  (min(  t  h  (x.),  1))  , 

P  3=1  p  J 

xl*  * • ■ ,xn  €  t0<1l  : 

h,  ,(x)  =  (1  -  x)p  ,  for  0Sxsi:p>0. 

(P) 

Then  It  can  be  shown  although  min,  max  are  non- 
Archimedean  and  not  in  the  family, 

minfXj, - xn)2:<t»&  (p)  . xn’>  *  min(x1 ,  .  .  .  ,xn)  -  (n1/p  -  1) 

max(x1(...xn)  +(n1/p-  1)  k  <t»or,(p)(xl . V  2  max{xl . V 


uniformly,  for  all  xi,'",'xn  «  [0,1]  . 

On  the  other  hand,  Frank's  family  g  ,  <{>or(S  as  s  -»  0  , 

approaches  min,  max,  respectively,  non-unif ormly . 

For  example, 


<Vs(x'Y) 


=  min(x,y) 

-log ( 1 — s ) 
log  s 


log(  1 


,|x-y|  _  max(x,y) 


( l-min(x,y) ) 


noting  the  critical  s^x  term,  e.g. 

Similarly,  recalling  that  roaxsum  is  not  a  copula  (but  is  a 
t-norm)  for  n  k  3  and  bndsum  is  not  a  cc-copula  (but  is  a  t-conorm) 
for  n  >  3  ,  and  for  all  +•  i  p  >  1  ,  <|>&)  is  a  copula  <t>&1  = 

maxsum  ,  and  <|>or  j  j  is  a  co-copula  ,  <j>or  ^  =  bndsum: 

P-1 

(1  -  rnaxfx^ , . . . ,x  ) )  P  •  maxsum(Xj , . . . ,xn) 

P-1 

+  1  -  (1  -  maxfXj . xn)}  p 

>  (x,,.,.x  )  £  maxsum  ( x,  .....  x  )  , 

& ,  p  1  n  l  n 

bndsum(x1 . xn)  2  ,  (p)  (xl . V 

P-1 

i  (1  -  max(Xj, . . . ,xn) )  p  • bndsumtXj , . . . ,xn)  , 


uniformly  for  all  xlf...,x  «  [0,1]  . 

Thus  maxsum  and  bndsum  are  uniform  limiting  cases  of  <p&  and 

respectively,  as  p  -•  1+  , 


or  ,p  ' 
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For  further  investigations  into  the  definitions  and  relations 
for  various  fuzzy  set  operations  see  Dubois  and  Prade  [51].  See 
also:  [55],  [160],  [199],  [200]  for  further  discussions  on  nega¬ 
tion;  [65],  [10],  [112],  [115]  for  implication  operator  modeling; 
and  [275]  and  [50]  for  interesting  general  discussions  of  how  fuzzy 
set  operators  should  be  defined.  In  addition,  again  see  [4],  [911, 
[140],  [226]  for  background  and  other  references.  See  also  the 
recent  comprehensive  paper  [44']. 


Appendix.  Pseudo-Inverses  of  Distribution  Functions  and  Bounds  on 
Probabilities  of  Events. 

Let  F  be  continuous  from  the  right  and  non-decreasing  over 
ir  .  For  any  x  e  s?  ,  define 

x  =  inf{t  |  t  €  ir  and  F(t)  =  F(x)} 

=  inf  F  1 (F(x) )  £  x  . 

Similarly  define  for  all  y  e  [0,1]  , 

Y  *  PF( y)  =  inf {z  |  y  s  z  e  rng(F)}  . 


By  the  right  continuity  of  F  it  follows  that  for  all  x  e  IR 
and  y  €  [0,1]  , 

*  *  * 

x  =  x  , 


F ( x)  =  F(x  )  , 

pf(pf(y) )  =  pf( y)  • 


Define  for  all  y  e  [0,1] 

F* (y)  =  inf  F_1(pF(y) ) 

=  inf  F_1([pF(y),  1]) 

=  inf (F-1 [y, 1 ] )  , 

=  F*(PF(y>)  . 

In  particular,  when  y  <=  rng  (F)  , 

* 

y  =  PF(y)  =  F ( x )  =  F(x  )  ,  for  some  x  e  R 

and 


F*  <y)  =  F*  (PF(y) )  =  F* ( F ( x ) ) 


* 


X 
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Thus,  for  all  x  e  R  , 

F*  ( F  (x)  )  s  F*  ( F ( x*  )  )  =  x*  . 

Hence  for  all  y  e  [0,1]  ,  since  pp( y)  e  rng(F)  ,  there  is  a 

* 

unique  x  e  R  such  that 

* 

F(x  )  =  PF(y) 

and  hence 

F(F* (y)  =  FF*(pF(y))  =  F { F*  F (x* ) )  =  F(x*)  =  pp  ( y)  . 

Furthermore,  for  all  x€R,ye[0,l], 

F  ( F* ( F ( x) ) )  =  FF*  F{ x* )  =  F(x*)  =  F(x) 

F* (F(F* (y ) ) )  =  F*(FF*PF(y))  =  F*{pp{y))  =  F* (y)  . 

Thus,  F*  is  an  actual  pseudo-inverse  operator. 

Also,  the  above  implies,  for  all  x  e  r  ,  y  e  [0,1]  , 

Pf(F(x))  =  pp(F(x*))  =  FF*  F (x* )  =  F (x* )  =  F(x) 

Lemma  1 

For  any  x  e  r  ,  ye  [0,1]  , 

F* (y)  s  x  iff  y  s  F(x)  iff  x  e  f  1[y,l] 

i  .  e .  , 

[ F* (y) ,  +•)  =  F-1 [y, 1 ]  . 

Proof :  (i)  If  F*(y)  s  x  ,  then 

ys  PF(y)  =  F ( F* (y } )  S  F(x) 

( ii )  If  y  s  F(x)  ,  then 

F* (y)  S  F*F(x)  =  x  s  x  . 


Application  1: 

U  be  distributed  uniformly  on  [0,1]  and  r.v.  X  have 
over  R  .  Then  F*®(U)  and  X  are  identically  distri- 


Let 
c.d.f.  F 
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buted. 

Lemma  2 

For  any  x  «  R  ,  ye  [0,1]  , 
x‘  £  F*  (y)  =  F* (PF( y) ) 

iff 

F(x)  =  F(x*)  £  pF(y)  . 

Proof:  Use  FF* (pF(y) )  =  Pp(y)  and  F*F(x*)  =x* 


Application  2 

Let  X  be  any  r.v.  having  c.d.f. 
x  €  R  ,  y  €  [0,1]  : 

(i)  Up  to  zero  probability, 

X  £  x  iff  X*  £  x* 

iff  X*  £  F* (F(x) ) 
iff  F ( X)  £  Pf(F(x) ) 

iff  F(X)  £  F(X)  . 
(ii)  F(X)  £  y  iff  F{X)  £  p  ( y) 

iff  X*  £  F* (y) 
iff  X  £  F* (y)  . 

(iii)  Pr(F(X)  £  y)  =  Pr(X  £  F*(y)) 

*  F ( F* (y) ) 

*  *»f(y)  • 

Hence,  the  c.d.f.  of  F{X)  is  p„  . 


F  over  R 


Then  for  all 


Application  3 


Let 


be  any  c.d.f.  over  Rn  with  marginal  c.d.f’s 


corresponding  to  r.v.  Xjf  j  =  l . n  .  Then  for  all 

V  v  <•  IP 


F  . 
1 


X1 ' ' ' " ,Xn  *  R  ' 


n 


F<xl . V  -  Pr{  &  (X.  £  x.)) 

j  =  l  J  J 

=  Pr{  &  (X,  £  F  (F{x  ) ) ) ) 
j  =  l  3  3 
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=  F(F 
=  G  ( F 


1 

1 


,{W . Fn* 

<xi> . W> 


6(Ya . Yn)  -  P(P*(y1) . Fn*(yn)>  ;  all  Yj  €  [0,1]  • 

Clearly  G  Is  the  joint  distribution  of  n  unif  [0,1]  r.v.'s 
^l'’.*'Un‘  A^so  marginally,  the  above  results  imply  each 

Xj  «  has  <=.d.f.  Fj  ,  j  =  1 . n  . 

Suppose  dually  that  F  is  a  continuous  from  the  left  nonde¬ 
creasing  function  over  R  .  For  any  x  e  R  ,  define 

F(x)  ^  sup  {t  |  teR&F(t)Sx}  . 


Lemma  3 

For  any  x  e  R  ,  ye  [0,1] 

?(y)  2  x  iff  y  2  F(x)  . 


Proof:  (i)  If  ?(y)  2  x  ,  then  immediately 

y  2  F(?(y) )  2  F{x) 

(ii)  If  y  2  F(x)  ,  then  using  the  definition  of, 
?(y)  *  ?(F(X))  2  x  . 

Lemma  4  (Bounds  on  the  Joint  Probability  of  Events) 

Let  (X,a,Pr)  be  a  probability  space.  Then,  for  any 

n  2  1  and  A, , . . . , A  e  a  , 
l  n 

(i)  maxsum(Pr(A1) , . . . ,Pr(An) )  s  Pr (A^n- -nAn) 

s  min(Pr ( A: ) - ,Pr(An) )  » 

max(Pr(A1) , . . . ,Pr(An) )  £  Pr(Axu- -UAn) 

£  bndsum(Pr(A1) , . . . ,Pr (An) )  , 

where 

maxsum(a1 , . . . ,an)  3  max(a1+--+an  -  (n  -  1),  0) 

bndsumfa. , . .  .  ,a  )  =  min(a,+--+a  ,  1)  . 
in  in 

(ii)  Pr(A1n-.nAn)  =min(Pr(A1) - ,Pr(An>)  , 

iff  for  any  i,  1  £  1  <  n  such  that 

Pr(Ai)  =  min(Pr (Aj ) , . . . ,Pr(An)  , 

Pr( A,  £  n  a.)  =  1  , 
lsjsn  3 

3*1 
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i.e.,  there  is  a  minimal  set  by  set-inclusion. 

In  particular,  if  Aj  *  • *  =  A  ,  (ii)  is  clearly 

satisfied . 

liil)  Pr(A1u--UAn)  =  max(Pr (A: ) , . . . ,Pr (An) ) 

iff  for  any  i  ,  1  £  i  £  n  such  that 

PrfA^)  =  max(Pr(A1) , . . . ,Pr (An) ) 

Pr ( A .  2  u  A.)  =  1  , 

1  1£ j£n  3 

j*i 

i.e.,  there  is  a  maximal  set  by  set-inclusion. 

In  particular,  if  Aj  «  •  •  =  A  ,  (iii)  is  clearly 

satisfied . 

(iv)  Pr(Ain--nAn)  =  maxsum(Pr (A1 ) , . . . ,Pr (An) ) 
iff  either 

PrU^-.riAj^)  =  0  or  Pr(Ai  u  A.  =  X)  =  1  , 

for  all  i,j;l<i,j<n;i*j, 

i.e.,  all  Aj ,  ....  A^  are  pairwise  exhaustive. 

In  particular,  if  Aj,  A2<  ...,  An  are  all  pairwise 

disjoint,  (iv)  is  clearly  satisfied. 

(v)  Pr(A1u--UAn)  =  bndsum(Pr (A1 ) , . . . ,Pr (A^) ) 

iff  either 

Pr(A1u- -UAn)  =  1  ,  i.e.  , 

Pr(A1u-.UAn  =  X)  =  1  or  Pr(Ai  n  A.  =  0)  =  1  , 

for  all  i,j  ;  1  £  i,j  s  n  ,  i  *  j  ,  i.e.,  all  A1(...,An  are 
pairwise  disjoint. 

In  particular,  if  AJ(A2,...,An  are  all  pairwise  disjoint 
(v)  is  satisfied. 

Proof  of  Leau»a : 

Results  (iii)  and  (v)  follow  by  use  of  DeMorgan  transforms  upon 
arbitrary  . 

For  n  =  2  ,  simply  use  the  modular  expansion 

Pr(AUB)  =  Pr ( A)  +  Pr(B)  -  Pr(ATiB). 

For  n  £  3  ,  use  induction.  For  example,  for  (iv)  for 
n  =  3  ,  if  Pr(A^  n  a2  n  a3 )  >  0  ,  consider 

Pr(Ax  n  A2  n  A3)  =  max(Pr(A1)  +  Pr(A2)  +  Pr(A3)  -2,  0) 

iff,  by  using  the  case  for  n  = 


2 
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iff 


iff 


Pr(A1  n  (A2da3))  =  Pr(A1)  +  Pr(A2nA3)  - 
Pr(A,nA,)  =  Pr(A,)  +  Pr(A„)  -  1 

4*  sJ  4.  w 


1 


Pr(Ax  U  (A2nA3)  =  X)  =  1 
Pr(A2UA3  =  X)  =  1 


'  Pr ( (A1UA2)  n  ( A^UAg )  =  X)  =  1 
Pr(A2UA3  =  X)  =  1 

iff  Pr ( AAJ A j  =  X)  =  1  ,  i  *  j  ,  i(j  =  i,  2,  3  . 

See  also  the  very  general  bounds  for  the  probability  measure  of 
any  wff  in  PL  developed  by  Hailperin  ([102'],  pp.  206-208), 
extending  his  and  Frechet 1 s  earlier  work  on  probability  bounds.  See 
also  section  2.3.9  (A). 


2-3.7  Functional  extensions  and  embeddings 

Extension  or  lifting  procedures  are  done  to  enlargen  the  domain 
of  point  functions  to  set  functions,  and  further,  to  generalize  set- 
functions.  Embedding  here  is  restricted  to  the  identification  of 
points  with  certain  dispersions  or  equivalently  generalized  sets: 
namely  dirac-like  forms.  Generalizations  of  extension  procedures 
and  embeddings  have  been  particularly  developed  by  Manes  [169].  The 
connections  between  the  fuzzy  theories  of  Manes  and  general  logical 
systems  will  be  established  in  Chapter  7. 

Here  we  present  essentially  two  natural  approaches  to  function 
extensions  and  discuss  briefly  some  connections  between  the  two. 

Let  X  and  Y  be  arbitrary  non-vacuous  sets  and  f  :  X  -*  Y  a 
fixed  function.  Consider  then  the  following  extension  and  inverse 
operations : 

f"1  :  »(Y)  -  *{X)  , 
where  for  any  C  e  ?(Y)  , 

f_1(C)  =  {x  |  x  e  X  &  f{x)  e  C}  =  U  f —  1  ( y ) 

y«c 

and  where  similar  notation 

f-1  ;  Y  *(v; 
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is  used  for 


(f-1  I  <<Y>  I  Y  «  V>) 


Also , 


7  :  9(X)  -  *(Y)  , 


where  for  any  B  e  9(X) 


7(B)  A  (f (x)  |  x  e  B}  . 


Similarly,  definitions  hold  for 


7  1  :  r?  (Y)  99(X)  , 

7-1  :  »  (Y)  99  (X)  , 


and 


f  :  *9  (X)  -♦  99  (Y)  . 


Standard  relations  which  will  be  useful  include: 

For  any  *  e  aa(X)  and  C  e  a(Y)  ,  if  :  0  ^  f  1 (C)  e  a  ,  then 

C  n  rng  f  «  7(a)  , 


f  1 (C)  €  a 

iff 

(SB) (B  €  a 

&  B  =  f-1 ( C ) )  , 

C  «  f(a) 

iff 

(SB)  ( B  €  a 

&  C  =  f (B) ) 

iff 

(SB) (B  €  a 

&  B  e  7-1(C) ) 

iff 

or  (B  « 

(BeT-1 (C) ) 

i  a  n  I-1 (C) ) 

iff 

{B)  -1 
b«7  ( c ) 

«  t  (an  7~1 (C) ) 

( Be?-1 ( C ) ) 

Similar  results  hold  at  the  point  level  for  C  replaced  by 
y  e  Y  ,  a  by  Be  ?(X)  ,  etc. 

Let  Hq  be  a  fixed  Heyting  algebra  and  recall  the  category 

theory  notation  SET  .  (See  section  2.4.2.) 

Let  9  ■.  Ob ( SET )  -  Ar ( SET )  be  such  that 

MX)  =  HqX  . 

'r(X)  =  0  is  also  permissible. 

Let  U  :  Ob ( SET )  Ob(SET)  such  that 

1  ( X)  £  U(X)  £  »(X)  ,  where  }  (X)  =  class  of  all  finite  subsets  of 

X  . 

Consider  then  the  following  extension  relative  to  9  and  U  : 
for  f  :  X  -»  Y  ; 


7  :  U(X)  U(Y)  . 

Suppose  3C  :  3"  ( X )  -♦  9  ( U  ( X )  )  is  given  such  that  for  any  x  e  X  , 
defining  €  7(X)  ,  a  dirac-like  form,  where 
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ax(y)  = 


1  Iff  y  =  x 
0  iff  y  *  x  , 


*(«x>< B)  = 


1  iff  x  e  B 


0  iff  x  <  B  , 


all  B  e  U(X)  . 

Suppose  also  first  f-1  :  Y  -*  U(X)  with  extension 

f_1  :  U ( Y )  U(X)  .  Then  F  :  ir(X)  -  jr(Y)  extends  f  :  X  -  Y  :  For 
any  g  e  sr  ( x )  , 

F(g)  =  *(g)°f-1  e  7 (Y)  . 

In  particular, 


t(Sx}  *{f(x)> 


all  x  «  X  . 


Tn  general,  although  F  extends  f  ,  it  need  not  extend  F  , 
(This  will  'oc  clarified  a  bit  later.) 

In  turn,  by  considering  f  1  :  U(Y)  -♦  U(X)  ,  then 

F  :  3"(X)  -*  9"  ( U(Y))  is  also  defined  which  extends  F  :  (X)  -*  g'(Y)  . 

Finally,  consider  f_  :  !T(U(X))  -♦  sr  ( U ( Y )  )  as  an  extension  of  F 

or  f  ,  where  for  any  g  e  tr  ( U ( X ) )  ,  £(g)  may  be  define  in  two 

different  natural  ways: 

(I)  Now  let  *  :  !T(U(X))  -*  9-{U(X))  with  similar 
restrictions  as  before.  Then 

f (g)  =  K(g)°f-1  . 

(II)  Simply  let  x(g)  s  g  and  thus 

f(g)  -  g°f-1  , 

noting  now  that  for  g  =  «x  ,  assuming  e  tr(U(X))  , 


a  (B)  = 
x  ' 


1 ,  iff  x  e  B 
0  ,  iff  x  *  B  , 


for  all  x  e  X  , 


f<ax)  =  axof  =  *c 


if  (x)  ) 


=  (C  I  C  6  U ( Y )  &  f(x)  €  C)  , 


(f  (X)  } 
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the  filter  class  in  U(Y)  on  f(x)  . 

As  was  shown  above,  the  function  *  plays  a  key  role  in  de¬ 
termining  expansions  ?  or  ^f  of  f  .  One  basic  procedure  for 

determining  the  choice  of  5C  is  as  follows: 

(i)  First  consider  f  as  an  extension  of  f  :  X  -♦  Y  . 

Let  A  e  ff(X)  with  <fA  €  g-(X)  and  let  y  «  Y  . 

First,  formally  treat  A  as  an  ordinary  subset  of  X 
y  «  Y  ,  We  have 


?(fA)(y)  =  $£(A)(Y)  =  11  <y  e  ?(A)>H  =  II  <y  e  ?(A)>ll 

=  H  ( <or  (<xeA>  &  <x«f-1{y)>li 
X€  X 


=  H  (  (x,x)xeX  e  t  Axf  1  ( y )  )  ii 
X€  X 


((X'X)xex)) 


^(Axf'^y,) 

=  f  (<PA  *  *  _i  ^x‘xKeX\) 

X€X  f  (y)  X) 

=  f  _a  (4 >.)((*)  )  , 

(xef  "*” ( y )  )  A  xef  1(y) 

since  rng(<|>  )  s  {0,1}  . 

t  (Y) 

If  the  logical  system  (C,x,t)  is  also  truth  functional 

f  -rnm  =  t  \  /. _  _ i  _ 


derived  from  *  =  (4>not .  V  <l>or)  (we  write  also,  e.g.,  $ 

^nt  ^or  lln^M  )  <  then  the  above  evaluation  becomes  simply 


not 


or 


"<y  €  t(A)>n  =  tor  (4>a(x))  . 
xef  1  ( y ) 

noting  again  the  relations, 


f 

( j=l . 

4^  A  (  X  -  ,  ,  .  , 
n)  Aj 

.,xn) 

=  <*!>'■ 

•••■♦»  <V> 

n 

X 

( J-l . 

*A  (x  ,  .  .  , 
n)  Aj  1 

.,xn) 

■•'^A  (xnn 
n 

for  all  Aj  e  s(Xj) 

•  XJ  *  *3  ■ 

j  = 

1 ,  .  .  .  ,  n  . 

Thus,  in  the  case,  for  all  $  e  7(X)  and 

*<V(B)  =  1>or(4>A(x))  , 

xeB 


B  €  U(X) 
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noting,  as  a  check,  the  boundary  conditions 

x  e  B  e  U(X) 
x  t  B  e  u(X)  . 


*<*<x>>'B>  - 


1 

0 


iff 

iff 


Thus,  because  $  bijectively  relates 
be  considered  dually  as  an  extension  of  f 


s*  ( X )  and  sf(X),  t 


may 


t  :  S(X)  -  *(Y) 
as  developed  above,  and 

f  :  y(X)  sr(Y)  . 

(ii)  Next,  consider  f_  as  an  extension  of  f  :  X  -»  Y  .  Again, 
treat  a  e  ?(U(X))  formally  as  an  ordinary  subset  of  U(X)  with 
C  e  U(X)  arbitrary. 


Approach  ( I ) : 

=  II  <C  €  f  (jl)>ll 
=  II  <C  €  ?(JI)>II 

-  T  _i  (<PaX(B)  )  . 

(Be?  1 (C) )  Bef  1(C) 

Subcase  1 :  Logical  system  (C,*,t)  is  truth  functional.  Then 
it  follows  immediately  that 

_t(V<C)  =  <for  <VB>>  . 

( Be?- 1 ( C ) ) 

analogous  to  the  truth  functional  case  for  evaluating  £  . 

The  following  refers  to  section  2.3.9  (B) : 

Subcase  2:  Logical  system  ( C  ,  x  ,  t )  is  Probability  Logic  (PL), 
which  although  extended  from  a  truth  functional  system 
7  =  ($  ,  <}>&,  $  )  (at  the  left-ray  level)  is  itself  not  truth 

functional.  For  PL,  denote  fj  -  <j>^  ;  a  *  <f>_1(^) 

(inverse  of  <p)  and  where  for  any  q  >  1  , 

7 (X)  =  U(X)  =  0  ,  if  X  *  U( • • (U(Rq) ) • • ) 

U(Rq)  =  ,  3-(Rq)  =  DPL5  g,  3-  { D  )  =  PL?  q  ;  q  2;  1  . 
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q  r 

Assume  u  €  and  C  €  E  ,  with  f  -»  R 

g  r 

It  then  follows  that 

JiA'HC)  =  t  {fj )  (  ( B )  ) 

Be?  ( C )  Bel  (C) 

=  (1  -  x  ,  (1  -  H))  (  *  ,  ) 

Be?  A(C)  Be?  ^ ( C ) 

-  I  <-l)Card  *+1(  X  „)(  x  B!1 

0*$£f  ( C )  Bea  Bea 

=  x_  (p)  (  r _  B)  , 

Bef  A(C)  Bef  1(C) 

noting  the  complicated  form  of  sc(p)  here  . 

Approach  (tt<: 


«_(A»)  CC)  =  »  <f  1  (  C  )  e  $  1  (a/  )  >11 


P(f  1 (C) )  , 


(same  as  the  induced  probability  measure  /jo  f  ). 

Returning  now  to  t  ,  relative  to  PL  ,  where  A  e  is 

such  that  <}>  e  d  ,  and  C  =  ray  (Y)  ,  for  any  Y  e  Rr  , 
a  cj  r 

-  1  -j  (-l)°ard  G+1(  x  fA)l(x)  ) 

0*G£f  1(C)  xeG  A  X 


=  Z 


,  . card  G+1a  ... 

(_1)  ‘I1*  (<l>A(x))  • 

0* G£  f  1 {C)  „ 

xeG 


Furthermore,  if  4>&  and  4>or  are  semi-distributive,  it 

follows  that  the  last  expression  (formally  the  same  as  the  Poincare 
alternating  sign  expansion  of  the  probability  of  a  union  of  sets) 
becomes  simply 


?<V(c)  =  4>or  (<t>AU))  . 

xef-1(C) 

the  same  formally  as  for  a  purely  truth  functional  system!  However, 
in  general,  even  if  and  $  are  modular  or  Frankian,  this 

does  not  at  all  imply  semi-distributivity .  (The  latter  class  is 
much  smaller  than  the  modular  class  of 
copulas . ) 

Consider  now  the  "extension  principle",  that  is,  the  general 
fuzzy  set  extension  of  ordinary  functional  transforms.  Thus,  if 
f  X  -•  Y  is  an  ordinary  function  and  C  is  an  ordinary  subset  of 
X  , 


7(C) 


{  f (X)  |  X  €  C }  £  Y  . 
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More  generally,  let  A  be  any  fuzzy  subset  of  X  and  y  e  y 
arbitrary,  then  using  the  previous  mechanism  in  evaluating  predi¬ 
cates 


*f(A)(y) 


II  <y  e  f  (A)  >11 

ll  ( 3  x )  ( x  €  A  &  y  =  f(x))n 

il  (or  )  (x  e  An  f-1  ( y )  ll 
xeX 


*or  Wx)'  *.-i.  . 
xeX  r  m 


(x)  ) 


4>  . 

xef  X(y) 


Note  that  ^(a)  has  same  f°rm  as  the  transformed  proba¬ 
bility  function  for  random  variable  r(V)  ,  when  V  has  probability 
function  $A  ,  for  fuzzy  set  system  5’  =  (1  -  (•).  prod,  bndsum) 

(where  bndsum  =  sum,  because  of  the  probability  functions  present). 

The  above  derivation  should  also  serve  as  a  partial  response  to 
Manes'  criticism  [170]  concerning  the  "arbitrariness  of  the  fuzzy 
set  (function)  extension  principle".  (In  addition,  using  random  set 
representations  of  fuzzy  set  theory,  another  justification  for  this 
definition  may  also  be  made.  See  section  5.2.) 

Other  fuzzy  set  operations,  including  various  linguistic  quan¬ 
tifiers,  are  also  definable,  as  illustrated  later  in  section  2.6 
through  examples  of  fuzzy  set  symbolization  of  natural  language 
expressions . 

We  establish  now  some  results  needed  for  considering  lifting 
operators . 


Theorem  I  (See  [181'],  pp.  248,  249  for  analogous  results.) 

Let  f  -«  iRr  be  Borel  measurable  such  that  if  C  £  is 

any  closed  set,  then  f(C)  e  e  .  Then: 

r 

(1)  For  all  Be  E^  ,  f(B)  e  Br  ,  up  to  Lebesgue  r-measure  0 
iff 

(2)  f  possesses  property  (N)  ,  i.e.,  for  all  A  £  Rq  with 
volq{A)  =  0  ,  volr ( f ( A) )  =  0  . 


Proof : 

If  (2)  holds:  Let  B  e  be  arbitrary.  Then  for  all  nil  ,  3  Bn 

£  r'5  closed  such  that 

B  £  B  and  vol  (B)  -  —  <  vol  (B  )  S  vol  (B)  . 
n  q  n  q  n  q 
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j  1 — 

Define  B  =  u  B  Then  B  =  B  0  <B  -t  B  )  with 

00  n  00  00 

n=l 

vol  (B)  =  vol  (B  )  and  vol  (B)  =  vol  (B-<B  )  =  0  .  It  readily 

q  q  oo  q  q  « 

follows  that 

f(B)  =  f(BJ  U  f(B  ^  B J 
+a> 

with  f ( )  =  U  f(Bn)  €  Br  and  vol  (B  H  B^ )  =0  , 
n=l  ^ 

noting  indeed 

+00 

vol  (f(B))  =  volr(  u  f { B  ) )  . 

n=l 

If  (1)  holds:  By  the  proof  in  Natanson  [181']  (originally  shown  for 
f  continuous,  but  not  dependent  on  that  property),  if  f  did  not 

have  the  (N)  property,  there  exists,  A  £  with  vol^(A)  =  0  , 

but  vol  ( f  ( N )  )  >  0  .  Hence,  there  is  a  non-measurable  set  B  c 
r  — 

f ( A )  .  Hence,  An  f  1 ( B )  £  A  and  thus  vol  (Anf-1(B))  =  0  with 

q 

f(Anf-1(B)}  =  B  .  Thus  (1)  is  violated. 

Example:  If  f  as  above  is  absolutely  continuous,  then  (1)  and  (2) 

both  hold. 

Recall  for  any  set  X  , 

} ( X )  £  U(X)  £  *(X)  . 

Theorea  II 

Let  f  :  X  -♦  Y  be  such  that  for  a  «  U(X)  and  a  e  U(Y)  , 
f ~1  (a )  ^  (f_1(B)  |  B  e  a)  £  a 

and 

1(A)  3  { f ( A )  I  A  €  A  }  £  a  . 

Then 

f_1(?(A))  £  f-1 (a )  £  a  . 

Theorem  III 

Let  f  :  X  -»  Y  ,  inducing  1  :  a(X)  -►  a(Y)  ,  and  let 


134 


Goodman  and  Nguyen 


A  €  U(X) 

Then 

(1)  f"1 

(f(A) )  £  a  ,  i.e. ,  f 

-1 ( f ( A) )  e  a  ,  for 

all 

A 

e  a  , 

iff  for 

all  C  e  9  ( Y)  , 

(2)  C  n 

rng(f)  e  ?(A)  iff 

f_1(C)  €  A  . 

( Proof : 

If 

(1)  holds: 

If  CD  rng( f )  e 

7(a)  ,  then  C  n 

rng ( f ) ) 

=  f(A) 

some  A 

e  a  .  Thus ,  f  1 ( C ) 

=  f-1(C  n  rng ( f ) ) 

€  * 

by 

(1)  • 

If 

f-1(C)  e  a  ,  then 

f ( f_1 (C)  )  =  f(f" 

^(C) 

n 

rng ( f ) ) 

ring( f ) 

.  Hence  C  n  rng(f) 

e  f(^)  . 

If 

(2)  holds: 

Let 

A  «  a  be  arbitrary  and  C  =  f(A)  . 

Since  clearly 

C  n  rng(f)  =  f(A)  e  7(A)  ,  It  follows  by  (2)  that  f  1(f(A))  e  a  . 
Remark . 

For  any  A  «  A  ,  f  1 ( f ( A ) ) 

set  B  in  *(X)  with  7(B)  =  A  . 

Corollary  1 

Let  f  :  -♦  Rr  be  Borel-measurable  (i.e.,  f  1(B  )  £  B  ) 

r  q 

such  that  for  all  B  e  Bq  ,  if  B  is  closed,  then  f(B)  €  Br  ,  or, 
if  volq(B)  =  0  ,  then  volr(f(B))  =  0  .  Then 

(i)  f“1(?(Bg))  £  Bg  , 

and 

(ii)  For  all  C  «  ?(Y)  ,  with  a  as  before 

C  n  rng(f)  e  7(a)  iff  f1(C)€A. 

In  particular,  if  f  is  absolutely  continuous,  then  (i)  and 
(ii)  hold. 


=  U  B  =  U  B  =  largest 

Be7_1(f(A))  f Be» ( X)  } 

17(B)  *  Aj 
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Remark  1 . 

For  any  C  «  »(Y)  , 

C  n  rng ( f )  e  “  (  a  ) 

iff  ( 3  A) (A  €  a  &  7(A)  =  C  n  rng ( f ) ) 
iff  ( 3 A) (A  €  a  n  7-1(C) )  . 

Remark  2. 

Note  that,  always,  for  any  C  «  »(Y)  ,  if  f-1(C)  e  a  ,  then 
C  n  rng(f)  e  7(a)  . 

Thus,  when  considering  functional  extensions,  in  general,  two 
choices  can  arise;  the  lifting  operator  tt  can  be  defined  as  fol¬ 
lows:  for  f  :  X  -*  Y  ,  tt(f)  :  f(X)  -♦  T  ( Y)  ,  where  for  A  e  ?(X)  , 
and  equivalently  $A  «  tr  ( X )  ,  and  y  ;«  Y  ,  by  treating  formally  A 

as  an  ordinary  subset  of  X  , 

(TT(f )  (<fA) )  (Y) 


II  <f  1  { y )  c  A> II 
or 

.  it  <y  e  ?(A)  >n  . 


The  first  interpretation  -  see  Corollary  1  -  is  relatively 

simple:  it  is  the  same  as  <|>A(f  1  ( y )  )  ,  a  desirable  form.  However, 

this  interpretation  is  only  natural  when  X  s  »(X')  and  Y  s  y(Y') 
for  some  spaces  X'  and  Y'  ,  and  where  we  interpret  for  any  y  e  Y  , 
by  abuse  of  notation 


f_1 (y)  =  u  f  1(z)  =  Uf  1 (y)  . 
zey 

In  particular,  this  holds  for  X  =  B  ,  Y  =  B 

q  r 

X1  =  Rq  ,  Y'  =  Rr  ,  with  (Rq,  Eg,  fj)  any  probability  space, 

A  €  ?(Bq),  €  3-(Bq)  =  »rq,  g  :  Rq  -  Rr,  f  ^  g  :  Bq  Br  , 

and  thus, 

it  <f_1  (y )  €  A  >  it  ---  <t>A(f_1(y))  =  M°f_1(y)  . 
noting  (Rr,  Br,  1) 


is  the  induced  probability  space  under  f 
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and  hence 

Tt(f)  :  »rg  -  *rr  . 

On  the  other  hand,  the  second  Interpretation  though  relatively 
complicated  -  see  Remark  1,  following  Corollary  1  -  can  always  be 
used,  where  here  simply 

f_1(y)  =  {x  |  x  e  X  &  f<x)  =  y}  . 


Thus , 


li  <y  «  £(A)>ll  =  II  <  ( 3  x )  (x  e  A  n  f-1(y))>ll 


=  il  (  or  <  xe  An  f  1(y)>)n 
xeX 


=-l, 


=  II  ( (X)  6  t  (A  n  f  ( y ) ) ii 

xeX 


A(Anf_1 (y) ( (x)xeX} 
X€X 

=  x^x(<f,Anf_1(y)  ]  (  (x)xeX) 


f  -1  (^M^x)  ^  1(V) 

xef  1(y)  A  X€r 


tA-l 

xel  1 (y) 


( (x)x€f_1(y) ]  ' 


using  the  fact  that  f  ('  is  an  ordinary  set  and  using  the 
properties  of  t  , 

Thus,  e.g.,  for  any  truth  functional  system 
7  =  (<,)not'  $&'  ♦or1  -  f  :  X  -  Y  ,  and  any  A  e  ?  (X)  ,  y  €  y  , 

<J>  (y)  =  <f  (y)  =  <|>  r  (♦.(*))  . 

(n(?)(A)  ?(A)  °£-T  A 

x«f  1  (y) 

Also,  note  that  Corollary  1  gives  sufficient  conditions  when 
the  two  interpretations  for  n  coincide. 

Again,  note  that  the  two  approaches  are  based  upon,  in  general, 
two  different  appearing  relations 


.-1, 


^Rel  (y'A,f)  =  11  <  ( ^  x )  ( x  €  A  n  f  (y)>n 


and 
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^Rel2(Y,A'f)  =  "<f  J<y)  e  A>"  ' 

for  all  f  :  X  -*  Y  ,  A  €  *(X)  ,  y  €  Y  . 

When  the  sufficient  conditions  (Corollary  1)  hold  for  f  ,  say, 
all  f  e  C(X,Y)  ,  then  for  all  y  €  y  ,  a  <=  *(X)  , 

wi(y*A'F)  =  ^Rei2(y'A'f)  • 

Equivalently, 

(vy) (VA) ( If  y  e  Y  and  A  e  *{X)  and  f  e  C(X,Y)  ,  then 
(Relj (y,A)  iff  Rel2(y,A)))  . 

Note  that  for  j  =  1,  2, 

Rel ^  £  Y  x  *(X)  x  YX 

and  may  be  considered  equivalently 

fRei  :  V  x  »(X)  x  YX  -  {0,1} 

or  as  the  function 

fj  :  Y  -  »(»(x)  x  YX)  , 

etc . 

Thus,  if  a  standardized  extension  procedure  is  adapted  for  all 
functions,  e.g.,  then  clearly  since  Rel2  =  Rel2  over 

Y  x  »(X)  x  C(X, Y)  , 

tttRelj)  =  rt(Rel2)  :  V  {Y  x  »(X)  x  C(X,Y)  )  -►  !T{0,1}  . 

However,  in  general,  even  if  such  a  consistent  procedure  were 
adapted,  a  given  relation  could  be  reinterpreted  distinctly. 

For  example,  Rel^  could  be  interpreted  as 

1>Rel  :  X  x  »(X)  -  {0,1} 

where 


or  Rel2 

^Rel2  :  X  ^  ’ 


to-i  U,A)  -  <p  — T  (X)  , 

Anf  1  (y) 


as 
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where 


>Rel2(A>  ' 


etc . 

Thus,  much  care  must  be  exercised  in  the  choice  of  definitions 
and  in  particular  the  application  to  the  extension  of  functions. 
(See  Manes'  related  approach  to  extensions  and  lifting,  section 
7.2.  ) 


Extensions  of  classical  logical  connectors  and  semantic 
evaluations  from  classical  logic  to  HQ-valued  logic. 

1.  All  statements  in  natural  language  must  be  put  into  formal  lan¬ 
guage  in  terms  of  memberships  w.r.t.  attributes  and  use  of  the 
operations  not,  &,  or,  in  perhaps  some  compound  form. 

2.  Consequently,  all  propositions  of  interest  in  the  formal 
language  may  be  reduced  to  strings  of  the  generic  form 

s  =  (comb (not,  &,  or))(<x1  «  Aj ),..., <x  €  An>)  ' 

3.  Each  such  string  s  may  be  further  interpreted  as 

s  =  <(Xj . xn)  e  (comb(C,x,t )) (Ax, . . . ,An) ) 

^  V - - - - - * 

4.  The  truth  extension  of  any  such  s  is: 

usii  =  *}>B(x) 

■  ( (comb(C  ,x,t  )  )  {<(>  ))(x) 

A1  An 

(However,  this  does  not  necessarily  imply  truth  functionality.) 

5.  If  the  system  is  truth  functional,  then  step  4  may  be  carried 
further: 


usii  =  c°mb($not,  $or  5  ^A  {xl 5  '  •  ’  ■  '$A  (xn}  ) 

1  n 

where  the  operators  $  .  «  ®  e  extend  their 

not  1  &  or  2 

classical  counterparts  $not  €  a*  1 $or  €  '  *n  some  sense, 

where 

=  (f  |  f  :  {-,l)n  -  {0,1}) 

is  the  class  of  all  n-argument  Boolean  polynomials  (or  truth  table 
functions)  for  classical  0-1  logic  and  where  we  could  choose, 
e.g.,  Hq  =  [0,1]  ,  and 
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»»n  3  {f  |  f  :  Hj  -  H0)  ,  n  ■  1,2 . 

is  the  class  of  all  generalized  n-argument  Boolean  polynomials  (or 
truth  table  functions)  and  where 

$A  (Xj)  =  ii  <xj  €  A j >n  ,  j  =  1 ,  .  .  .  ,n  . 

6.  In  place  of  step  5,  for  the  non-truth  functional  system  LSL 
( Lebesgue-Stielt jes  Logic),  iisil  must  in  general  remain  as  in  step  4 
and  be  evaluated  by  relations  established  between  C,  *,  ♦  . 

However,  for  the  subsystem  PL  (Probability  Logic),  we  do  have 
the  relation,  where  each  x-  e  B  , 

J  qj 

<t) a  €  »  j  =  1,  •  •  •  ,n  , 

1  qj 

Hsll  =  <$A  x.  •  •  x<pA  )  (comb(C  ,x,t  ) )  (Xj  ,  .  .  .  ,xn)  . 

1  n 

(Note  that  each  x^  here  is  a  set,  not  a  "point".) 

Consider  next  step  2.  It  is  well  known  (see,  e.g.,  Enderton 

[54])  that  in  classical  0-1  logic,  (d>  .  ,  <t>_  ,  *  )  form  a  com- 

not  «  or 

plete  Boolean  operator  system,  i.e., 
d  ® 

ag>  =  u  a?n  =  {comb($not ,  <|>&,  4>or)  I  comb  is  an  arbitrary 
n=l 

combination  forming  strings). 

The  question  of  operator  completeness  for  any  [0,1] -valued  logic 
becomes  essentially  a  moot  one.  (See  [54],  pp.  62-66,  313-315.)  In 
any  case,  the  above  property  justifies  step  2  for  formal  logical 
abstractions  of  two-valued  logic  to  more  general  ( [0 , 1 1 -valued ,  for 
example)  logics. 

However,  consider  the  following  alternative  evaluations  for 
iis»  in  steps  4  and  5: 

We  can  extend  all  at  once  to 

=  { f  |  f  e  ) 

by  use  of  either  definition  for  {*)  or  n  ,  as  discussed  earlier. 
In  particular,  this  means  that  for  a  truth  functional  system,  the 

0-1  logical  connectors  <f>  .  «  , ,  A  *  ,  d>  ,  *  e  »g>  _  all 

not  1  T  &  T  or  T=»  T«=»  2 

extend  ( non-uniquely)  to  some  $  .  ,  ,  $  ,  $  ,  $  e  .  in 

not  «  or  »  «=» 

turn,  these  extensions  may  be  used  in  steps  3,  4,  5  for  modeling 
and  evaluating  sentences  or  strings  s  .  (The  triple  of  primitive 
operators  (not,  &,  or)  could  be  replaced  by  (not,  &,  or,  =»  )  , 

etc..  In  generating  sentences  through  all  choices  of  comb.)  Hence, 
any  s  in  step  5  is  evaluated  by  this  approach  as 
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[0,1]  =  5 ( {0,1})  , 

where  for  any  x  e  [0,1] 2n  ,  $Q[x)  e  [0,1]  can  be  considered  the 

degree  of  truth  of  £  operating  on  x  ,  while  f^(x)  «  [0,1]  can 

be  considered  the  degree  of  falsehood  of  f'(x)  .  Manes  ([169]  and 

see  also  section  7.2)  considers  this  approach  to  the  extension  of 
operators  as  well  as  to  the  extension  of  membership  and  equality 
relations.  In  general,  we  take  the  first  approach  for  simplicity, 
with  the  other  in  mind.  The  resolution  of  choice  of  the  most 
appropriate  extension  of  classical  logic  operators  to  HQ-valued 

logic  remains  a  difficult  problem.  Some  progress  at  the  practical 
level  may  be  seen  in  the  work  of  Zimmermann  [291]  and  others  [250], 
[275].  See  also  section  8.4  and  Chapter  9  for  related  issues  where 
asymptotic  behavior  may  be  utilized.  Frank's  or  Yager's  families 
can  serve  as  natural  parametrized  classes  of  connectives  -  see 
section  2.3.6  again. 


2.3.8  Further  semantic  evaluations 

In  sections  2.3.5  and  2.3.6,  we  have  considered  some  basic 
formal  language  operations,  and  their  direct  general  set  operation 
interpretations,  as  well  as,  several  possible  corresponding  semantic 
evaluations,  depending  of  course  on  the  particular  logic  chosen. 
These  included: 

nt  (negation)  -  complement  -!-(•)  ,  etc. 
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&  (conjunction) 
etc . 

or  (disjunction) 
»  (implication) 


intersection  or  more  generally,-  t-norms, 
cartesian  product. 

union,  or  more  generally  -t-conorms,  etc. 
cartesian  sum 

if  (•)  then  (••)  -  nt(*)  or  (••)  , 
etc . 


<=*  (double  implication)  -  iff  -  (if{*)  then  (••))  and 

(if (- *  )  then  (• )))  • 


In  addition,  in  section  2.3.7,  we  treated  the  important  con¬ 
cepts  of  the  formlization  of  ordinary  functions  (within  a  formal 
language)  and  the  consequent  semantic  evaluations  as  extensions  of 
these  functions. 

In  this  section,  we  look  at  some  additional  formal  language 
concepts  together  with  their  corresponding  semantic  evaluations. 
All  of  these  are  related  to  functional  extensions  and,  more  gen¬ 
erally,  fuzzifications  of  classical  concepts. 

It  should  be  noted  that  the  term  "projection"  as  used  in  sub¬ 
section  (D)  should  not  be  confused  with  the  function  symbol 
proj  :  0b(/.)  x  Ob (l)  -*  Ar (l)  and  its  semantic  evaluations.  In 

addition,  only  two  quantifier  function  symbols  -  together  with  the 


axioms  and  rules  in  which  they  appeared  in 


Th 


synK ,  K 


and  their 


semantic  evaluations  -  were  considered  in  some  depth  in  sections 
2.2.2  and  2.3.2,  namely:  v  and  3  .  Even  so,  the  axioms  and  rules 
governing  these  are  compatible  with  essentially  only  intuitionistic 
logic  and  in  particular  not  with  general  negations,  t-norms, 
t-conorms,  i.e.,  general  fuzzy  set  systems.  Furthermore,  other 
quantifications  -  such  as  Mo  and  Abt(3/4)  were  listed  as  function 


symbols  in  2.2.2,  representing,  "most",  "about 
but  no  axioms  and  rules  were  introduced  in 


3/4",  respectively, 
ThsynK,K  governing 


their  behavior.  Consequently,  because  of  the  nature  of  general 

quantification,  an  alternative  approach  -  basically,  only  through 

semantic  evaluations  and  not  through  any  particular  theory  -  is 

developed,  as  given  briefly  in  subsection  (F)  here,  although,  of 

course,  the  function  symbols  for  such  quantifications  do  appear  in 

the  formal  language.  Note  also  that  the  extreme  quantifiers,  3 

and  v  ,  can  be  considered  as  eliminative  definitions  in  a  natural 

sense  in  terms  of  interated  or  and  &  operators,  respectively. 

However,  as  before,  this  approach  and  the  axiomatic  one  developed  in 

Th_,„,  v  in  general  do  not  coincide.  Nor  do  these  coincide  with 
syn^  #  i*w 

the  single  "percentage"  approach  (II)  used  here  in  subsection  (F) . 


A.  Compound  operators 

"if  (•)  then  (•■)  "  can  be  defined  compoundly: 

( •  )  =>  (••)  as  (not(-)  or  (••))  ,  or  can  be  defined  directly, 
"iff"  «=>  usually  defined  as 
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( if ( •  )  then  {-•))&  if  (••)  then  (•))  »  i.e., 
(•»••)&(**»*)• 

The  concepts  of  incidence  and  subset  have  already  been 
approached  through  majorizing  membership  functions.  Another 

(different)  approach  is  through  the  compound  formalizations: 

(  A  A  B  )  =  or  (x  €  A  n  B)  , 

X€X 

(  A  £  B  )  =  &  ( (X  «  A)  =»  (X  e  B) )  . 

xeX 

This  naturally  leads  to 

(A  =  B)  =  (A£B&B£A)  . 

But  note  that  this  does  not  in  general  imply  the  following  semantic 
evaluations  as  $A  =  ,  i.e., 

II x  €  All  *  II x  e  BH  ;  for  all  x  e  X  . 

Indeed,  under  a  truth  functional  system, 

HA  =  Bn  =  k(x).  <t>B(x))  - 

xeX 

<t>&  (^(^(X),  <f>A(X))))  . 

xeX 

Other  equivalence  relations  similar  to  equality  of  sets  have 
been  discussed  in  section  2.3.3  on  GOdelian  Logic  and  in  section 
2.2.2  (see  "*")  ,  as  well  as  in  Appendix  3  (iv),  section  2.4.2.. 


B.  Intensifiers 

For  any  A  e  ?(X)  and  non-negative  integer  n  ,  define  A®  , 
the  n1"*1  intensification  of  A  as,  for  all  x  e  X  , 

(x  €  A  ®  )  =  &  (x  e  A.)  , 

j=l . n  J 


where 


A1  =  A2  =  • •  “  An  “  A 

and  define  the  n*"*1  extens  i  f  i  cat  i  on  or  stretching  of  A  as  ,  for 
all  x  €  X  , 

(x€A®)  =  or  (x€A.) 

j=l , . . . ,n  J 

where  again 


A  . 
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Thus,  if  a  truth  functional  system  is  present,  typical  evalu¬ 
ations  are 


iix  e  A©  ii  =  (4>a  (x)) 

j=l ,  .  .  .n 

^Ax- • -xA(x' • * - ,x)  ' 
nx  e  A®  li  =  <j>Qr  (<|>A  (Xj)) 

3=1 _ n  j 

^At  •  •  •  t A(x' - • * ,xn}  ' 

If  further,  is  a  strict  Archimedean  t-norm  with  a 

canonical  representation  as  given  in  section  2.3.6; 

<f>&(u,v)  =  h-1(h(u)  +  h(v)  ) 

then  it  readily  follows  that 

ii  x  e  a®ii  =  h-1  (n-h($A(x) ) ) 

with  a  dual  representive  for  lix  «  A  ®  ii  ,  when  *  is  a  strict 

or 

Archimedean  t-conorm.  In  turn,  these  relations  may  be  used  to 
define  immediately,  arbitrary  real-intensifications: 

For  any  non-negative  real  r  , 

ii x  e  A©  ii  *  h-1(r- h(<$>A(x)  )  )  . 

Note  the  basic  exponential  relation,  for  any 
r ,s  2  0  ,  x  e  X,  A  e  5 (X)  , 

li  x  e  (A©)©  il  =  it  x  e  (A  ©  )  ©  II 
=  lix  «  A  v-7  II 
*  h_1(rs-  h(4>A(x)  )  )  . 

Thus,  linguistic  hedges  such  as  "more  or  less",  "very", 
"extremely",  "little",  etc.,  could  all  be  defined  through  proper 
choice  of  exponent  above.  (But  see  also  [51]  and  [167]  for  other 
approaches . ) 


C.  Conditioning  (See  also  section  2.3.9) 

Let  A  e  5 (XxY)  and  B  e  ?(X)  ,  noting  that  trivially 
B  e  ?(XxY)  where  for  all  x  e  X  ,  y  e  Y  , 

$B(x,y)  =  $B(x)  • 


Suppose  also  that  (in  the  membership  function  ordering  sense) 
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A  £  B  , 

i  .e.  , 

sup  $A(x,y)  i  <J>B(x)  ;  all  x  e  X  . 
ye  Y 

Suppose  finally  that  <f>&  is  continuous.  Then  by  the  basic 

(Darboux)  property  of  continuous  functions,  if  follows  that  there 
exists  at  least  one 

( A | xe B )  e  ?(Y)  ;  all  x  e  X 

such  that 

II  ( ye  ( A | xe B )  )  &  (x  e  B)il  =  ll(x,y)  e  All  , 

i.e.,  assuming  truth  functionality, 

(Y)  .  <fR(x))  =  (x,y)  ;  all  x  e  X  . 

( A  |  xe  B  )  B  A 

We  call  ( A  |  xe  B )  the  conditional  generalized  set  A  given 

x  e  B  . 

The  above  relation  is  equivalent  to 
( A  |  xe  B )  X  B  =  A  . 

If  <J>&  is  strictly  increasing  in  its  arguments,  then  the  above 
conditional  generalized  set  is  uniquely  determined. 


D.  Prolectlons  and  interactions  of  generalized  sets 

Let  A  e  9(XxY)  .  Then  for  any  given  $  ,  the  X-projection 

of  A  is  given  by,  for  all  x  e  X 

Vojx(A)(x)  =  *or(Vx**,) 

X  ye  y 

=  ii  or  {  (x,y)  e  A)  n 
yeY 

=  'I  (3 yeY)  (  { x , y )  e  A):: 

assuming  a  truth  functional  system. 

Thus,  using  the  basic  property  of  t-conorms 

Vojx(A)(x)  *  VX'V>  *  VX'*>  ' 

for  all  x  e  x  ,  y  e  Y  ,  implying 

4>a(x'Y)  ~  ^&^(A|xeprojx(A)  )  '  ^projx(A)^X^  ' 


Then, 
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( A | xeprojx(A) )  =  projy(A) 
iff 

A  =  projy(A)  x  projx(A)  , 

in  which  case,  the  two  projections  of  A  are  said  to  be  non- 
interactive,  or,  in  particular,  independent,  for  =  prod. 

For  other  approaches  to  projections  and  conditioning  see  Nguyen 
[188]  and  section  3.3.  See  also  section  2.3.9  for  related  results 
for  Probability  Logic. 


E.  Bayes 1  theorem 

Let  X  and  Y  be  any  two  spaces  and  assume  for  each  x  e  x  , 
a  set  Bx  «  2(Y)  exists.  Also,  let  C  e  ?fx)  be  given.  Then  for 

any  chosen  <$>OT.  ,  ( A  |  ye  pro  jy  ( A)  )  e  ?(X)  exists  for  each  y  e  Y  such 

that  for  all  x  €  X  , 


(x,y)  =  (Y =  4>r.(<f> 


x 


&(A|y«projy(A) ) 


(x)  ,<}> 


projy(A) 


) 


(The  proof  is  immediate  from  the  definition  of  conditional  sets.) 

We  can  identify  without  loss  of  generality  (with  abuse  of 
notation) 


Bx  =  ( A | x  e  projx(A) ) 
C  =  projx(A) 
projy(A) 

(A|y  e  pro jy ( A ) ) 


conditional  set 
prior-X  set 
averaged-Y  set 
posterior-X  set 


However,  in  general,  equality  does  not  really  hold  for  the 

first  two  relations.  Indeed,  if  for  each  x  e  X  ,  there  is  a 

y  e  Y  such  that 
x 

*B  (V  =  1  ' 

X 


then 


*B(x)  5  Vojv(A)(x)  5  *or(Vx)> 
X  y«Y 


and  hence  for  a  non-Archimedean  t-norm  such  as  sup  ,  equality  indeed 
does  hold. 

The  use  of  Bayes’  Theorem  procedes  analogously  as  in  the  clas¬ 
sical  probability  situation,  which  incidently,  is  clearly  a  special 
case  of  the  above  development  when  discrete  spaces  are  considered 
and  $£  =  prod,  =  bndsum  and  are  ordinary  probabil- 

°K 

(See  Chapter  9  for  applications  to  combination  of 


ity  functions. 
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evidence  problems.) 


F .  Quantification 


Generally,  quantification  may  be  approached  as  follows:  (See 

also  Zadeh’s  extensive  work,  e.g.,  [283],  [286].) 

First  assume  as  usual  that  a  fixed  logic  and  a  logical  system 
are  present.  Let  "quant"  be  the  linguistic  form  for  any  percentage 
quantifier  such  as:  many,  some,  all,  almost  none,  few,  sometimes, 
about  3/4,  two-thirds,  most,  etc.  It  is  assumed  that 
^quant  :  [0>1]  -*  [0,1]  •  Note,  however,  some  dichotomy  for  the 

extreme  quantifications  "for  all,"  and  "there  exists":  These,  of 

course,  may  also  be  treated  from  an  interactive  conjunction  and 
disjunction  approach,  respectively. 

Let  "pop"  be  the  linguistic  label  for  some  fixed  ordinary  set 
representing  the  base  population  to  be  considered.  Let 
A,B  e  7 ( pop )  ,  representing  two  attributes.  We  wish  to  evaluate 

semantically  the  sentence 

s  =  quant (A  is  (are)  B)  . 


For  example: 

"most  men  are  short  here",  "several  tall  people  have  jaundice", 
"sometimes  fat  people  have  breathing  problems",  etc. 


Approach  I  -  Use  of  implication  operator. 

s  =  quant  ( Vx  e  pop)((x  e  A)  =»  (x  6  B))  . 

Assuming  the  logical  system  is  sufficiently  truth  functional, 
the  above  evaluation  becomes 

1,811  =  Vant{  *  <WX>  '  Vx),))  • 

n  xepop 


Approach  II  -  Use  of  conditioning. 


Suppose  formally  that  "univ"  stands  for  the  ordinary  set  repre¬ 
senting  all  possible  populations  or  "worlds"  of  interest.  Also,  let 
"wt"  be  a  weighting  attribute.  Then  we  may  also  formally  consider 
A , B  e  ?(univ)  and  the  wff's 

a  =  (pop  e  A) 

=  ( 3 x  e  pop)(x  e  A  &  x  e  wt)  , 
b  =  ( pop  e  A  n  b ) 

=  ( 3  x  €  pop)  (xe  ADB&xe  wt )  . 


Note  that  from  the  above  definitions,  it  does  not  follow  in 
general  that  truth  functionality  holds  for  A  n  B  relative  to  A 
and  B  ,  i . e  .  , 
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^ATlB^p0p)  *  ^&^A(P0P)  '  iVPOP,)  • 

However,  truth  functionality  may  be  applied  to  x  e  A  n  B 


*AnB(x)  =  Wx)  '  Vx)) 

may  hold. 

Thus,  typically, 

"a"  =  ^or  ( Wx)  '  V(x)  5  > 

xepop 

"b"  =  ^or  (V*A{x)  '  Vx)  '  (WX>>> 

xepop 

and  hence,  il(b|a)n  is  evaluated  from 
ii bn  =  <{>&(ii  (b|a)n  ,  ii an  )  . 

In  turn, 


ll  (quant(b|a)  )il  =  t>quant  (  »  (b  |  a)  II  )  . 

In  particular,  if  <Pwt(x)  a  l/card(pop)  ,  <J>&  =  prod  (or  min),  and 
1>or  =  bndsum  ,  then 


ii  (b|a)n  = 


xepop 


<?>a(x)  •  $B(x)  / 


Z  $.{x)  s  card(ATiB)  /card  A 

X€  pop  * 


™ltb  11  (quant(b|a)  )ii  then  obtainable  -  the  same  result  as  obtained 
quantifications  2831 '  [286]  in  hls  "fuz2Y  cardinality"  approach  to 

Again,  as  in  the  general  case,  modeling  of  quantifiers  mav  be 
carried  out  in  several  different  ways.  y 

See  section  2.3.9  (B)  for  more  detailed  development  of  condi- 
oning,  projections,  Bayes1  Theorem,  and  quantification  as  applied 
to  probability  and  Lebesgue-Stielt jes  logical  systems. 


G  • 


Transformations  of  dispersions,  induced  measures  and 
linguistic  variables.  ‘  ~ 


3  fi3persion  space  (X,*,p)  consists  of  a  set  X  , 
J(X)  s  A  s  *(X)  ,  with  A  possibly  having  additional  structure  such 

add  it6/™  i3  °_3l9ebra'  and  Hi*-*  [0,1]  ,  also  possibly  having 

ditional  structure  such  as  being  a  probability  measure. 

Let  (X,A,fj)  be  an  initial  dispersion  space  (Y,»)  a  space 
such  that  J(Y)  s  *  c.  *{Y)  ,  for  some  set  y  ,  with  f  x  -f  ? 

being  a  -  %  measurable,  i.e.. 


,-l , 


f""(a)  -  (f_1(B)  |  B  e  a)  s  { f "" 1  ( A )  |  A  €  a)  , 

where  as  usual  f_1  :  ( y )  ?  (X)  ,  where 
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f-1(A)  =  {x  I  x  €  X  &  f(x)  e  A)  . 

Then  f  Induces  a  (final)  dispersion  space  (Y,»,w)  ,  where 


Here,  (X,.A,p)  is  a  probability  space  and  hence  (Y,»,y)  is 
also  a  probability  space,  the  one  induced  by  f  ,  also  considered  a 
random  variable. 


Example  2 . 


listic  variables  (See  also  Zadeh  [279].) 


Let  X  be  a  measurement  space.  Typically,  X  £  Rm  ,  or 
X  =  {••  red,  yellow,  ••)  ,  etc.  Let  Y  be  a  space  representing  a 

population,  such  as  all  living  female  humans  between  30  and  40 
years  of  age  and  holding  a  Ph.D.  degree  in  Economics.  Let 
g  :  Y  -»  X  representing  a  measurement  mapping  such  as  age,  height, 

y 

race,  weight,  etc.  Indeed,  X  may  be  considered  the  class  of  all 

Y 

measurement  mappings.  Next,  ling  :  X  -►  ^(X)  is  the  linguistic 
variable  mapping  where,  for  example 


ling (age)  =  {••  old,  young,  very  old,  about  20  years  old, 
middle  age ,  . . .  } 

ling(ht)  =  tall,  very  tall,  short,  between  60  and  68  inches 

tall ,  . . . )  ,  etc . 

A  natural  constraint  is  that 

ling(XY)  =  u  ling(g)  , 
geXY 

V 

and  a  basic  condition  is  that  for  any  g  e  X  and  any 
C  «  ling(g)  £  ?(X)  ,  there  is  a  rc  €  ?(Y)  such  that 

<y  «  rc>  =  <g(y)  e  C>  ,  all  y  e  Y 

and  hence  taking  semantic  evaluations 

(Y)  =  l'  ( Ye r c ) M  =  ll(g(y)  e  C)n  =  <j>c(g(y))  , 

C 

for  all  y  e  Y  . 

Thus,  for  example,  for  y  =  John  e  Y  ,  g  =  age  and  C  =  young, 
rc  =  "young"  and 

VungtJohn)  =  Vung(a9e(John,)  * 
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y 

Now  consider  again  any  g  e  X  .  Then 

g-1  :  rng(g)  -*  rng(g-1)  ,  where  rng(g)  S  X  ,  rng(g-1)  s  *(Y)  , 

is  disjointly  bijective  (due  to  the  partitioning  of  Y  induced  by 

g-1}  ,  noting  { g~ 1 )  ~ 1  =  g  :  ^(rngfg-1))  -♦  a(rng(g))  .  (g  being 

the  set  extension  of  g  as  previously  developed  in  2.3.7.) 

In  this  case,  we  let  the  initial  dispersion  space  be 
(rng(g)  ,  a..  >j)  where 


.A  §  J  (rng(g)  ) 
d  * 

P  =  <I>C  ' 

with  C  e  lng(g)  arbitrary  fixed,  and  where  for  any  Q  e  a  , 

VQ)  =  <Vx)  >  ' 

xeQ 

for  t-norm  fixed. 

Then  we  may  take  for  the  transformation  here 
m  d  -1 

f  =  g 

resulting  in  induced  dispersion  space  (rng(g-1),  a,  u)  ,  where 


a  =  1 ( rng ( g  ) ) 
d 

y  =  4> 


where  for  any  Q  e  J(rng(g  ))  , 


4>  (0)  =  <}>r( g(Q) ) 

rc  c 


=  <|»&  (<j»c(g(Q))) 

Q€Q 

=  ( <PC  ( g  ( y J ) ) 

y«  QeQ 


2.3.9  Examples  of  seai-truth  functional  logical  systems 

In  this  section,  the  formal  language  £  is  for  Probability 
Logic.  (See  Maydole  [172]  ,  p.  145;  Hailperin  [102’];  Rescher 
[214],  pp .  184-188;  also  section  2.3.3.)  We  will  first  proceed  to 
specify  the  concept  of  dispersion  and  show  that  the  probability 
logical  system  is  semi-truth  functional  (i.e.,  up  to  the  rays-level) 
but  not  truth-functional.  Next,  we  will  extend  the  probability 
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logical  system  to  the  Lebesgue-Stielt jes  logical  system. 


A.  Probability  logical  svstea 


I.  As  in  2.3.5,  let  Y  =  R  ,  and  X  =  B1  ,  the  Borel  o-algebra 
of  R  ,  so  that  T(X)  =  7(R)  =  the  class  of  all  probability  measures 


over  ( R ,  B  j )  .  If  Pr  <=  ir(iR:)  ,  we  write  Apr  =  <t>_1(Pr)  e  ?(R] 


for 


the  generalized  set  on  with 


f>.  =  Pr  .  Pick  any 

Pr 


«-copula,  e.g.,  min,  prod.  Let  4>or  be  the  DeMorgan  transform  of 
• 

First,  construct  by  Sklar's  theorem  (see  2.3.6  or  [238])  all 

joint  probability  distribution  functions  F.  for  any  given 

$  •  •  •  f  n 

Pr,,...,Pr  «  sr  { R )  using  . 

in  6c 


F1 . n<al . an^  =  VFl(al> . Fn<an>> 


for  all  a j  e  r  where  F^  is  the  distribution  function  correspond¬ 
ing  to  Prj  ,  j  =  l,...,n  .  Thus,  for 


‘j 


'•  V  €  B1  ' 


Pr 


1 , . . .n'  a 


(Ia  x-..xla  )  =  ^(PT1(Ia  ) . Prn(Ia  )) 


n 


n '  a 


n 


(truth  functionally  over  rays). 

In  turn  this  uniquely  determines 


Note  here  x  is  defined  (through  <J>&)  by 


Pr,x • • • xPr  =  Pr, 

1  n  l  >  i  •  •  i  n 


We  extend  x  to  include  forms  such  as  (1  -  Pr^  x  Pr2  , 

(1  -  Pr^  x  (l  -  Pr2)  ,  Prj  x  (1  -  Pr2)  ,  for  two  arguments  and 

similarly  for  three  or  more  agruments,  by  formal  expansion,  where 
1  (A)  =  l  for  all  A  e  B  ,  etc. 

~X  1 

For  example, 

(1  -  Pr x )  x  Pr 2  =  1  x  Pr2  -  P^  x  Pr2 

where  it  should  be  noted  for  any  A,B  e  B^  , 

(  (  1  -Pr  j  )  x  Prz)(AxB)  =  (lxPr2)(AxB)  -  ( Pr^Pr^  )  ( AxB  ) 

=  Pr2(B)  -  Pr12(AxB) 
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=  Pr12(CAxB)  , 


and  for  all  A , B  €  B  : 


((1-Pr1)  x  ( 1-Pr2) ) (AxB)  =  ( lxl-Pr^x l-lxPr2+Pr^xPr2 ) (AxB) 

=  1  -  Prx(A)  -  Pr2(B)  +  Pr12(AxB) 

=  Pr12(CAxCB)  . 

It  can  be  verified  that  here  x  is  a  legitimate  cartesian  product. 
Note 

C  <G  e  Apri>  =  <G  e  C  A^^ 

=  <G  €  Al-Pr1> 

=  <CG  6  Ap  > 

1 

Note  for  any  A , B  e  B1  , 

At  B  =  C (CA  x  CB ) 

=  (A  x  R)  U  (IR  x  B)  , 

A  x  B  =  (A  x  K)  n  (R  x  B)  , 

Pr12(A  t  B)  s  Pr^A)  +  Pr2(B)  -  Pr12(A  x  B) 

(modular  form)  s  (Pr1  x  1) (A  x  B)  +  (1  x  Pr2)(A  x  B)-Pr12(A  x  B)  . 
(DeMorgan  form)  a  (1-  (1-Pr^x  (1-  Pr2))(A  x  B) 


Define  for  any 


a  =  Pr1  or  1  -  Pr^  , 

P  =  Pr2  or  1  -  Pr2  , 
at/J=(axl)  +  (lxyB)-ax/J. 


Thus , 


(Pr1  t  Pr2) (A  x  B) 

=  Pr12(A  t  B)  ,  etc. 

So  t  is  a  legitimate  cartesian  sum. 

Thus,  x  and  t  form  a  DeMorgan  and  modular  system  here. 

Prj  t  Pr2  =  (Pr1  x  1)  +  (1  x  Pr2)  -  P^  x  Pr2  , 

noting  Pr1  x  Pr2  =  Pr12  • 

In  particular,  if  $5,  =  min  and  Pr1  =  Pr2  , 


[ Pr j  r  Pr ^ ) ( A  x  B)  =  Pr12(A  t  B) 


=  Pr^A)  t  Pr2(B)  -  Pr12(A  x  B) 
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Pr12(A  x  B)  =  Prj(A  n  B)  , 

(Pr1  t  Prx)(A  x  B)  =  Pr12(A  t  B)  =  Pr^A  u  B) 

Use  the  notation 

corab( c .  x,  t ) (Prj ,Pr2 , . . . ,Pr  ) 

to  Indicate  some  combination  of  prob.  measures,  as  for  example: 
(1  -  Pr2)  x  (Pr2  +  (1  -  Pr3))  x  Pr4  . 

Theorem 

For  any  probability  spaces  (X^,  A Prj^  '  3  =  l,...,n 
any  A  ^  e  ,  j  =  1 ,  ,  .  .  ,n 

comb  (  C  i  x,  t  )  (Pr^  .  .  .  ,Prn)  (A1#  .  .  .  ,  An) 

=  PrjX- • • xPrn(comb( C ,  x,  t ) (Ax , . . . ,A  ) )  . 

(Note  that  Pr.x. . ,xPr  =  Pr,  .) 

x  n  l  § «  •  ■  0  n 

Thus ,  e . g . , 

((1  -  Pr1)  x  (Pr2  t  (l  -  Pr3))  x  Pr4)(A,  B,  C,  D) 

2  Prl,2,3,4(CA  x  (B  r  cc)  x  * 

Examples: 


(Pr1  x 

Pr 2 )  ( A 

X 

B) 

Pr12(A  x  B)  ; 

(CPrx  x 

Pr2)  (A 

X 

B) 

= 

(1  -  Prj  x  Pr2) (A  x  B) 

= 

(1  x  Pr2)(A  x  B)  -  Pr1  x  Pr2(A  x  B) 

= 

Pr2(B)  -  Pr12(A  x  B) 

= 

Pr12 ( CA  x  B)  ; 

(CPr  x 

CPr2)  (A 

X 

B) 

= 

( (1  -  Pr:)  x  (1  -  Pr2) ) (A  x  B) 

= 

1  -  P r 2 ( B '  -  Pr:(A)  +  Pr12(A  x  B) 

= 

Pr12( CA  x  CB)  ; 

( Pr  :  t 

Pr2)  (A 

X 

B) 

= 

Prx(A)  +  Pr2(B)  -  Pr12(A  x  B) 

= 

1  -  ( (1  -  Pr:)  x  (1  -  Pr2) ) (A  x  B) 

= 

Pr12(A  t  B)  ; 

(CPrx  t 

pr2)(A 

X 

B) 

= 

Pr12(CA  +  B)  ; 

and 
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((CPr1  t  Pr2)  x  (Pr3  t  Pr4))(A  x  B  x  C  x  D) 

=  ((CPr1  +  Pr2  -  CPr2  x  Pr2)x(Pr3  +  Pr4  -  Pr3  x  Pr4))(A  x  B  x  c  x  D) 
=  (CPr1  x  Pr3  +  CPra  x  Pr4  -  CPrj  x  Pr3  x  Pr4 

+  Pr2  x  Pr3  +  Pr2  x  Pr4  -  Pr2  x  Pr3  x  Pr4  -  CP^  x  Pr2  x  Prg 

-  CPrx  x  Pr2  x  Pr4  +  CPr1  x  Pr2  x  Pr3  x  Pr4)(A  x  B  x  C  x  D) 

=  Pr13(CA  x  C)  +  Pr14(CA  x  D)  -  Pr134<CA  x  C  x  D) 

+  Pr23(B  x  C)  +  Pr24(B  x  D)  -  Pr234(B  x  C  x  D) 

-  Pr123(CA  x  B  x  C)  -  Pr124(CA  x  B  x  D) 

+  Pr^  234 ( C A  x  B  x  C  x  D) 

=  Pr1234( (CA  t  B)  x  (C  t  D) )  . 


II.  By  the  bijection  between  probability  measures  and  c.d.f.s 
over  ,  q  2;  1  ,  we  consider 

T(Rg)  ^  {A  |  A  e  sf  (X)  &  =  F  where  F  is  a  c.d.f.  over  Rg) 

Pick  not,  &,  or  to  have  their  ordinary  meanings,  so  that  not 
is  involutive,  &  ,  or  are  symmetric,  associative  distributive,  etc. 
Let  <f>&  be  an  r-copula  (not  necessarily  associative  nor 

symmetric).  Then  by  Sklar's  Theorem  (2.3.6),  for  any  A ^  e  s(R) 

(here  *  Fj  c.d.f.)  ,  x^e  R  ,  j  =  l,...,r  , 


^A  x • • • xA  (X1 ' 
j.  r 


,xr)  =  FjX* • *xF  (x^, . . . ,xr) 

“  {X1)  '  *  '  •  '^A  (xr}  5 

l  r 

=  ^(^(Xj)  ....  *Fr(xr)  ) 

=  II  <(Xj,  .  .  .  ,Xn)  €  A^x  •  •  •  xAr>ll  , 


is  a  legitimate  c.d.f.  -  as  a  function  of  x  =  (x1#...,x  )  e  Rr  . 

Conversely,  (by  Sklar's  Theorem  ),  any  c.d.f.  F  over  Rr  may  be 
written  as,  for  any  x  =  (x1,...,xn)  e  Rn  , 


F(x1,...xr)  =  ♦&(F1(x1) , . . . ,Fr(xp) ) 


where  (f)&  =  F  ( F1  *,...,  Fr*  )  is  an  r-copula  and 

F.  =  F(+®  -ho  ,...,+• )  is  the  marginal  c.d.f.  of  F  , 

j  =  1 ,  .  .  .  ,  r  . 

Define  t$>Qr  to  be  the  modular  complement  of  ,  i.e.,  for 

anV  11  x.  ■  "r  *  (0,1]  , 
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. "‘S'V 

Thus,  for  all  u1,u2,u3  e  [0,1] 

♦or^l*  =  U1 

♦or^l.V  =  U1  +  u2  "  VU1'V 

^or  ^U1  'u2  *u3^  =  U1  +  u2  +  U3  ‘  VU1'V  “  VU1'U3> 


r&'  1'  3' 


-  4>&(u2,u3)  +  V’V'VV  ,  etc. 

Define  negation  operator  as  simply 

<f>nQt(u)  =  1  -  u  ;  all  u  e  [0,1]  . 

We  can  interpret,  for  any  x.  «  R  ,  A.  e  sf(ir.)  ,  *  =  F. 

J  J  Aj  3 

c.d.f.,  corresponding  to  r.v.  :  Q  -*  R  where  (n,a,Pr)  is  a 

fixed  probability  space  and  left  ray  R  =  (-»,x.],  j  =  l,...,r  . 

Xj  3 

<x  .  e  A  .>  =  <V.  e  R  >  ,  j  =  l,...,r  and  jointly, 

J  J  J  X  j 

<(xrx2)  €  Ax  X  A2>  =  <(V1,V2)  e  R  x  R  > 

=  <Xj  €  Aj>  &  <x2  e  A2>  =  <VX  €  Rx  >  &  <V2  e  Rx  >  < 

<(x1,x2,x3)  e  Aj  t  A2  t  A3> 

=  <x1  €  A1>  or  <x2  e  A2>  or  <xg  e  Ag> 

=  <V  €  R  >  or  <V  e  R  >  or  <V_  e  R  >  ,  etc. ,  where  some 

1  xl  2  3 

of  the  Xj's  or  A ^ ' s  may  be  the  same. 

Thus,  a  typical  string  may  be  written  as 

s  =  comb(not,  &,  or) (<x,  e  A.  >,..., <x  e  A,  >)  , 

1  J1  m  jm 


where  A  .  ,  .  .  .  ,  A  .  €  {A.. 

J1  Jra  1 


,  A  )  ,  and  hence 
r 


s  =  comb(not,  &,  or)(<V.  e  R  >,...,<V.  e  R  >) 

31  X1  Jm  xm 

=  <(V  ,...,V.  )  e  comb ( C , x , t ) ( R  . Rx  )>  , 

J1  Jm  1  m 

by  the  usual  relations  between  ordinary  complement  C  ,  cartesian 
product  x  ,  cartesian  sum  t  ,  and  ordinary  not,  &,  or,  so  that 


s  =  <(xlf...,xm)  e  comb( C , x , t ) (A  .  ,...,A.  )  , 


so  that 
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I1SII  —  ,  .  .  ,  _  »  i  ( x.  f  »  *  •  i  x  } 

Tcomb{ C , x , t ) ( A .  ,...,A.  )  1  m 

J1 

*  comb(<*W'  V  *or>M>A  {xl)'-‘-'<?)A.  (xmn  ' 

in  general,  and  hence  non-truth  functional  ,  although  ilsil 
depend  on  corab(<fnot,  <p&,  <t>Qr )  • 


Example  1 . 

Let  r  =  2  ,  and  consider  B  as  the  shaded  region: 


(xry2> 


<x2,y2> 


(X2'yi} 


Let  here  m  =  4  ,  with 


A  =  A 
J2 


\  =  F1  ' 


AJ3  =  Aj4  =  A2  '  %  =  F2  '  etC- 

s  =  <Xj  e  A1>  &  (not  <x2  €  Aj>) 

&  <ya  €  A2>  &  (not  <y2  e  A2>) 

=  <(x1,x2,y1,y2)  €  ((Ax  x  CA2)  x  (A2  x  CAj)) 

=  <(V1,V1,V2,V2)  e  (Rx  x  CRX  )  x  (Ry  x  CRy  ) 

•  <<vl'va)  *  <\  -  H  Ry2>> 

-  <(V,  ,V  )  «  R  «  P„  -I  <8  x  a  J  J  (H  x  8  )  u  (8  x  8 

1  *  X1  yi  x2  yl  X1  y2  X2 

=  <V,  €  R  H  R  >  &  <V„  €  R  HR  >  , 

1  X1  x2  2  yi  y2 

with  truth  evaluation 

II  sll  =  F.  x  F.((Rv  H  R  )  x  (R  HR  )) 

1  *  X1  x2  yl  y2 

=  (Fx  x  F2)(x1(yi)  -  (Fa  x  F2)(x2,y1)) 

-  (Fj  x  F2 ) (Xj ,y2 )  +  (Fj  x  F2)(x2,y2) 


Example  2 . 


s  =  (<x1  e  Aj>  &  not  <x2  «  A1>) 


does 
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or  (<yx  «  A2>  &  not  <y2  e  A2>) 

=  <(x1,x2,y1,y2)  e  ( (A1  x  CA2)  t  (A2  x  CA2))> 

-  <(VJ,',2I  *  <\  -1  %>  *  “Yl  -  \l> 


so  that 


ISII  =  (F  x  F  )({R  ^  R  )  t  (R  ^  Rv  )) 

1  *  xi  x2  Y1  y2 

-  (Fi  f  F2)((RXl  h  rx2>  *  <RYl  h  *Sr2»>  • 


In  turn,  by  using  standard  extension  procedures  of  measure,  the 

above  results  may  be  extended  from  propositions  <x  e  A>  to 

<B  e  a>  ,  where  <x  e  A>  =  <R  e  a>  and  =  <t>.  is  the  probability 

X  r  -A 

measure  corresponding  to  c.d.f.  F  and  B  e  b^  ,  the  Borel  field  or 

o-algebra  over  .  Marginals,  conditionals,  independence  all  hold 

as  usual.  Thus  we  identify 


c omb(C ,*,t ) (^F  , . . . )(Rx 
1  m  1 


) 


=  comb(C,x,f ) {F1( . . . ,Fm) (Xj, . . . ,xm) . 


Hailperin  [1021]  has  recently  contributed  a  most  important 
paper  concerning  Probability  Logic  {PL) . 

Firsc,  a  brief  history  is  given  for  the  problem  of  determining 
if  probability  theory  can  be  viewed  as  a  multiple-valued  logic. 
Included  is  the  Reichenbach-Tarski  controversy,  where  Reichenbach 
considered,  formally,  probability  analyzable  as  a  truth  functional 
multiple-valued  logic,  when  conditional  probabilities  are  also 
treated  as  (hidden)  variables  along  with  marginal  probabilities; 
Tarski  taking  a  critical  view  of  this. 

Hailperin’s  approach,  avoiding  the  problems  of  quantification, 
etc,,  is  restricted  to  developing  a  semi-truth  functional  PL  . 

More  specifically,  Hailperin  utilizes  the  normal  disjunctive 
form  available  for  any  propositional  wff  r  in  a  formal  language  i 
where,  say,  FV(y)  =  (Xj,...x  )  ,  for  some  (free)  variables 

x . xn  ,  and  some  n  =  n(P)  2:  1  , 

ip  =  comb  (not,  &,  or)  (x  ,  .  .  .  ,xn) 

=  comb(not,  or)  (&  B  |  B  s  ))  . 


where  for  any  B  =  (x .  ,...,x.  )  ,  say, 

Ji  Jm 


&  B  -  x . 

^1 


& .  .  .  &  x  . 


Then,  generalizing  the  ordinary  concept  of  a  model  (see  section 
2.3.3),  in  effect,  a  probability  model  PM  for  a  collection  of  pro¬ 
positions  {?  . ?r)  with 
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r 

=  U  FV (?  . ) 

°  j=l  j 

is  any  probability  function  PM  over  {&  B  |  B  £  Aq)  . 

In  turn,  if  r  is  any  wff  with  FV!?)  s  Aq  ,  then 

P  (?)  S  z  Pm(&B) 

{ B I BS A  is  such  that  &B  h  v } 

1  o 

where  B  >*  t  is  the  usual  material  implication  in  0-1  logic 
(see  section  2.3.3  (a))  . 

A  word  of  caution  in  the  use  of  Hailperin's  scheme:  One  must 

distinguish  carefully  between  the  primitive  (author's  term)  con¬ 
junctions  and  similar  appearing  but  different  compound  ones.  For 
example,  suppose  that  Aq  =  (Xj.Xj)  .  Then  the  four  possible 

primitive  conjunctions  are, 

Sd3  ,  to,  ,  &x2  ,  &{x1(x2}  =  x:  &  x2  , 

keeping  (on  purpose)  the  &  notation  on  unary  forms,  where  the 
corresponding  probabilities  satisfy 

+  PM^X1^  +  PM^X2^  +  PM^X1  &  x2^  -  1  ' 

But  note  that  for  x^  considered  as  a  compound  expression, 

Xj  =  &Xj  or  (Xj  &  x2) 

*  &x  j  ,  j  =  1,2  , 

and  since  for  ordinary  0-1  logic 

Xj  >»  Xj  and  (x1  &  x2 )  >*  ,  j  =  1 , 2  , 

but  it  is  not  true  that 

0i*  x j  nor  x.  Xj  ;  i  *  j  , 

it  follows  that 

PM(xj}  =  PMl&Xj)  +  PM(X1  &  x2}  '  ^  =  1,2  ' 

and  hence 

PM(x1  &  x2)  s  min  PM(x1(x2)  , 
as  should  be,  etc. 

(Note  the  analogue  of  this  situation  with  that  in  Chapter  9 


(B)  .  ) 


158 


Goodman  and  Nguyen 


It  follows  that  PM  obeys  all  the  usual  laws  for  a  probability 
logic  operator  including,  for  all  8 ,?  e  Wff(t)  ,  x  any  variable 


(a) 

pM(n  =  o 

,  if 

v  (x  &(not 

x) 

) 

(b) 

pm(9)  *  pm 

(?) 

,  if  e  w  ? 

(c) 

PM(not8)  = 

1  - 

pm<9> 

(d) 

PM<0  V  v ) 

=  PM 

(8  )  +  PM(y)  • 

if 

( e  )  w 

is  relative  to 

ordinary  0  - 

1 

logic . 

In  turn,  since  (b)  yields  PM  having  the  same  evaluation  over 

0-1  logically  equivalent  (ww)  wff’s  ,  Hailperin  uses  the 
Lindenbaum-Tarski  probability  algebra  which  identifies  all 
wff's  e  l  which  are  logically  equivalent. 

In  addition,  an  extended  material  implication  is  defined:  For 

any 


(1) 


Z  -  {®0'ei' - Sm}  S  wff (*> 


with,  say. 


(2) 


FV(Z)  =  U  FV(e  )  =  {x  - - x  }  , 

j  =  0  J 


say,  where  n  =  n(Z)  ,  and  hence  we  may  write,  w.l.o.g. 


(3) 


3^  =  0^(x^,..., xn )  ,  i  —  0,1, ...,n  , 


and  for  any 


(4) 

(5) 


aQ  and  «  =  «Vi«l . 


m 


ai  "  [ai.t>1]  S  [0,1]  ;  i  =  0,1, 


,  m 


define  the  (a,aQ)-level  probabilistic  implication. 


(6) 


[9  ,  e  a  .,  ...  ,9  e  a  )  w  .  ( 8  _  €  a- )  , 

1  1  m  m '  prob '  0  O' 


iff,  for  all  possible  probability  models  PM  : 

if  0?M(®  x)  «  “j . PM<V  6  V  '  then  PM<V  €  a0  • 

The  first  basic  result  is  the  reaffirmation  that  the  tauto¬ 
logies  of  standard  PL  and  ordinary  0-1  logic  coincide.  (See 
also  Rescher  [214],  pp.  184-188,  for  more  details.) 


Theorei 


(Hailperin  [102‘],  Theorem  2.1) 


For  any  9  €  Wff ( i )  , 
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M  .  (0 

prob ' 


1)  iff  t»  8  ,  for 


two-valued  logic. 

The  next  result  is  a  further  extension  of  Hailperin’s  gen¬ 
eralization  of  Frechet's  theorem  on  the  bounding  of  probabilities. 


Theorem  (Hailperin  [1021],  Theorem  3.2,  4.2) 

Let  z  £  Wff (z)  be  arbitrary  with  all  notation  holding  as  in 
(1)  -  O)  • 

Then  there  exists  L_  ,  UD  :  [0,l]2m  -*  [0,1]  ,  such  that  for 

D  IJ 

any  aQ  and  a  as  in  (4),  (5)  and 


(i)  for  any  probability  operator  P  (including  PM)  over  t  ,  if 


Pfe^.)  e  ai  ,  i  =  1 ,  .  .  .  ,m  , 


then 


LB(a)  £  P(80)  <  Ly (a )  , 

where  [Lg(a),  Ug(a)]  is  the  tightest  possible  bounding  interval 
for  P(«0)  .  Lg , Ug  are  obtainable  by  solving  a  linear  programming 
problem . 


Ui) 


(«1  ^ 


iff 


a  ) 
m' 


prob 


0  e  a0) 


[Lg(a),  Ug (a ) ]  S  aQ  . 

(iii)  In  particular,  note  that  if  m  =  n  and 

Sj  =  s j<x!' • • • <xn)  =  Xj  ,  j  =  1 . n  , 


then  (ii)  implies 


(Xj  €  Oj , . . 

'  *  y‘n 

e  a  )  t  ,  ( e  _  e 

n'  prob  v  0 

[LB(a) , 

Ug (a ) ] )  . 

Note  that  part 

(i) 

of  the  theorem 

may  be 

interpreted  as  a 

sensitivity  analysis 

for 

P(0g)  ,  when  the 

initial 

0^  (or  x^  as 

in  (iii))  have  probability  values  known  up  to  interval-values.  In 
addition,  note  that  the  upper  and  lower  bounding  functions  and 

Lg  are  truth  functional  in  form.  Indeed,  for  eQ  in  pure  disjunc¬ 
tive  or  conjunctive  form  and  initial  8  i  =  xi  '  Uemma  4  of  the 
Appendix,  section  2.3.6,  shows  equivalently  that  Lr.  and  L„  are 

in  t-conorra  or  t-norm  form,  respectively. 

See  also  Chapter  6,  following  Cor.  1  and  section  5.3  B  for 


160 


Goodman  and  Nguyen 


further  discussion  of  probability  bounds. 

B.  Lebesque-Stieltles  logical  svsteii. 

Use  the  basic  theorem  (McShane  [174]:  see  also  following 

Appendix)  that  if  f  :  Rg  -»  [0,1]  is  of  strong  bounded  variation 

( SBV )  ,  there  is  an  ?  such  that  the  interval  functions  Vr(f,-  )  = 

Vr(?,-  )  and  the  extended  measures  p  =  where  /j  is  a  finite 

r  f  £ 

countably  addi-  tive  complete  regular  signed  measure,  where  w.l.o.g. 


f 


f  + 


f 


f  ,  f  are  nondecreasing  positive  increment  monotone  grounded 
continuous  from  the  right  -  and  hence  are  generalized  c.d.f.'s 
yielding  non-negative  complete  regular  finite  countably 


additive  measures.  We  say  ?  is  equivalent  to  f  . 

Then  replace  in  probability  logic  any  c.d.f.  by  an  SBV 
function  and  in  the  extension  to  the  Borel  set  level,  probability 
measures  by  these  finite  signed  measures. 

The  interpretation  and  truth  evaluation  of  strings  proceeds 
analogously,  except  that  Sklar's  Theorem  no  longer  holds,  in  the 
more  general  setting.  It  is  assumed  that  $<.,  ,  is  sufficiently 

well-behaved  that  comb(<fc  .  ,  <b. ,  *  )  preserves  SBV,  etc. 

not  &  or 


In 

w .  1 .  o  .  g  . 


summary,  given 
as  in  the  basic 


•ny 

theorem, 


7  (IR  J  ) 


"J 


assumed 


CA 


(x. 


and  defining 


1  -  <$>A  (x  )  , 

Aj  ° 


or 

id 


we  can  show  that  the  following  relations  must 


*  X  C  ,A  .  fxl . Kr  J 


j=l,....rJ  j 


=  M  <  &  <x  .  €  C  •  A  .  >n 

j=l,....rJ  J  J 

=  r  (_ncard  5.<p  Six.)  i 

S5(l . rHC  /  Aj  \  3  3  J 

crus  J 


hold 


where 
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Cr  a  {j  I  M  j  s  r  5  C,  =  id)  , 


>tA  <X1 . xr>  =  :!  or 


•  •  *  »r 


<xj  €  W 


j=i 


=  ^  (-Dcard  s+1* 

0*  SS  { 1 . r } 


’  ♦or  1<X]  6  ‘j’’ 

3«s  i*1 . r 


Let  us  consider  Lebesgue-Stielt jes  Logic  in  further  detail. 
Note  that  all  the  theorems  referred  to,  unless  otherwise  notated, 
will  be  found  in  the  Appendix  at  the  end  of  this  subsection. 

Define  the  following  important  spaces: 


ci  cj 

s  =  {f  |  f  is  a  (simple)  step  function  over  Rq} 

Fg  =  <f  I  f 

BVg  =  {f  |  f 
SBVg  ^  {  f  |  f 
og  =  {  f  I  1 


PM 


SPM. 


{f 

(f 


Rq  -*  R  (Bcrel)  measurable) 

Rq  -•  R  &  f  is  BV  (bounded  variation)} 
f  :  Rq  -♦  r  &  f  is  SBV  (strong  bounded  variation)} 
f  :  Rq  -♦  [0,1]  is  a  c.d.f .  } 
f  :  Rq  -*  R  is  PM  (positive  increment  monotone)  ) 
f  :  Rq  -  R  is  PM  5  SBV} 


O  IR^ 

Recall  the  spaces  of  all  dispersions  over  R  ,  [0,1]  ,  and 

Rq 

{0,1}  ,  the  space  of  all  ordinary  set  membership  functions  over 

Rq. 

Thus ,  e . g . , 

pq  pq 

(0,1)  S  [0, 1]  n  iFg  , 

Rq  Rq  Rq  R?q 

s  n  [0,1]  ,  o  s  sbv  n  [o,i]  s  bv  n  [o,l]  s[o,l]  of 

^  CJ  ^  ^ 


By  Theorem  2,  e.g.,  Sg  n  [0,1] 


and  hence  sBVg  n  [0,1' 


and  bv  n  [0,1] 
q 


are  all  dense  w.r.t. 


[0,1] 


n  f 


over  arbi- 


t,t]q  ,  t  >  0  in  L^-norm.  By  Theorems  6-8, 


trary  q-intervals 
we  may  always  identify  (or  need  only  consider) 


BVq  =  (f 


SBVg  =  (f 


f  :  Rq  R  is  a  g.d.f.  (generalized  distribution 
function) }  , 


R 


R  is  a  SBV  g.d.f.)  , 
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PM  =  {f  I  f  :  R 

q 


is  a  PM  g  .d . f . }  , 


SPMq  =  (f  1  f 


Rq  -  R+  is  a  SBV,  PM  g.d.f.}  S  D 


Recall  c  is  the  class  of  all  q-copulas  ,  a  subclss  of 

q  oc 

all  conjunction  operators.  Then  Sklar's  Theorem  may  be  reformu¬ 
lated,  involving  in  a  key  way  generalized  cartesian  products, 

0  =  U  Ex.  ••x.  D,  ,  all  q  £  1  , 

q  ct.ec  1  1 

q  ' - v - ' 

q-f actors 

where  for  any  q-copula  . 


0 .  x 


x  D  S  {f  x  --x  f  |  f  e  D  j  =  1 , . . . , q}  . 


Define  the  q-right  closed  ray  class  as 
Ray  =  (J(-®1  •  Y)  I  Y  €  Rq) 

u  4 

and  note  the  bijective  mapping  ray  :  *— *  Ray  >  where  for  all 

4  4 

y  e  Kq  ,  ray  (y)  =  J(-«°-l  ,y)  .  (We  will  interchange  often  ray 

q  q  q 

and  Rayq  . ) 

Define  the  class  of  all  complete  regular  probability  measures 
over  0  by 

q 

?rq  =  |  fj ^  ,  Bq  -*  [0,1]  is  the  unique  complete  regular 

probability  measure  induced  by  f  ,  for  any 
f  e  0q)  ,  all  q  z  1  . 

It  follows  from  bijection  ray_  and  the  standard  construction 

q 

of  complete  regular  probability  measures  ^  ^  associated  with  given 

c.d.f.  f  (a  special  case  of  the  Lebesgue-Stielt jes  measure  con¬ 
struction)  that  a  bijective  relation  exists  between  0q  and 

*r  .  u  1  0  « — »  9r  ,  where  for  all  fen  , 

q  *  q  q  q 

f )  =  . 

and  where 

fj  .(ray  ( y )  )  =  f(y)  ;  all  y  €  Rq  . 

*  4 


In  turn,  it  follows  that  Sklar's  Theorem  yields 
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9  r  =  u 
q  $  €C 
T&  q 


(»r  x 


■  x  .  ?r,) 
1 


all  q  i.  1  , 


where  for  any  q-copula 


*r  .x 
1  tr 


x*  a  {p.  x 


T&  q 


fj  e  ,  j  —  1 » ■ • • < q)  < 


x<p  • 

1 


S  ^x.  • 

«  q  i  4>& 


V  .  A 

q 


is  consistently  defined  by  the  unique  extension  property  of  proba¬ 
bility  measure  given  any  c.d.f.  f  . 

By  considering  co-Cg  ,  the  class  of  all  DeMorgan  q-cocopulas, 


all  possible  generalized  cartesian  sums 


may  be  obtained, 


generated  by  $or  e  co-c 
strings  of  the  form  f^t 


Dual  to  the  above  development  for  x.  , 

•t  f  may  be  found  as  well  as  these 
™or  q 


involving  C  ,  noting  for  all  f  e  o 


C =  1  -  Pf  =  Pf(C' 


and  finally,  strings  involving  arbitrary  mixed  finite  combinations 
comb (  C  ,  x  ,t  )  (or  equivalently,  comb( , 4>or )  )  •  Special 

vor 

cases  of  these,  as  in  the  development  of  general  logical  systems, 

include  the  definitions  for  n  ,  u  ,  A  ,  s  ,  £  ,  etc.,  all  depending 
on  particular  choices  of  truth  functional  system  9  =  (<}>not ,  <t>& ,  <}>or )  • 

Thus  a  probability  logic  in  the  extended  sense  may  be  considered  as 
the  class  of  all  finite  strings  involving  a  fixed  9  and  m^'s  . 


It  is  required  that 


be  symmetric  and  associ¬ 


ative  operations,  and  hence  unambiguously  extendable  to  any  finite 
number  of  arguments. 

In  general,  a  probability  logic  is  not  truth  functional  - 
although  clearly  restricted  to  the  left-ray  level,  since  it  reduces 
to  all  finite  strings  of  c.d.f.  's  involving  J  ,  it  is  truth  func- 
ional .  Let  us  consider  this  in  some  more  detail: 


Let  f  e  in 


g  e  Dr  ,  and  define 


c^f  **  -  (1  ~  Hm)  x.  «  S  l(q)  x.  » 


t  9 


V®  f  9 


where  l(q)  :  R  -*  { 1 }  ,  and  l(q)  x  is  the  embedding  of 

A'g  :  -*  [0,1]  into  Bq+r  #  where  for  all  B  €  b^  ,  C  €  B^ 
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U(q)  >,All«  »  C)  -  „(»  x  0)  -  „g(C> 

“  <P& 

Define,  similarly  pf  x^  l(r)  .  Also,  define 


1  (q)  x  1  ( r  )  ( B  <C)=1, 


Note  that  Cpf  is  not  to  be  confused  with  p^f  ,  since  for  all 
B  €  E 

q 

Pcf(B)  =  Ml-f(B)  =  ^_f(B)  =  “A'f  (B) 

from  the  Lebesgue-Stielt jes  construction,  noting  1  -  f  is  not 
grounded  but  is  equivalent  to  -f  which  is  grounded  (see  the  Appen¬ 
dix)  .  On  the  other  hand,  note  Cp^  is  not  any  type  of  measure  (let 

alone,  a  probability  measure)  and  thus 
Cpf(B)  =  1  -  pf(B)  *  Pcf(B)  . 

Also,  define,  for  B  e  Bg  ,  c  €  arbitrary 

(Cpf  x  Cp  ) (B  x  C) 

=  (d  "  Pf)  x  (1  -  P_)){B  x  C) 

y 

=  { 1  (q  +  r)  -  Pf  x^  l(r)  -  1  (q)  x^  +  pf  x^  pg)  (B  x  C)  . 

-  u  -  Pf I®)  -  Pg(C)  +  pfx  g)(B  x  C), 

=  pfx  g(CB  x  CC)  . 

Similarly, 


(Pf  t  Pg) (B  x  C) 

=  C (Cpf  x  CP  ) (B  x  C) 

“  <A*f  xd>  1<r>  +  x<h  “  A»fx  q )  ( B  x  C) 

&  T&  a 


=  pf(B)  +  p  (C)  - 
"  ^fx.  g<B  »  C>  ' 


^fx.  g  ( B  x  C) 


where 

B  t  C  =  (B  X  Rr)  U  (Rq  x  C)  ,  B  x  C  -  ( B  x  Rr)  n  ( Rq  x  C)  . 

Compatability  between  t  and  $or  and  C  will  be  achieved  by 
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choosing  5  to  be  any  Frankian  or  modular  logical  system  (see 
section  2.3.6)  .  Then  let  t  = 

^or 

More  generally,  if  comb(C,x>.  ,  t.  )  denotes  any  given  corn¬ 
ea  9or 

bination  of  probability  operators  which  can  be  applied  (in  a  well 
defined  manner)  to  any  collection  of  probability  measures 

-♦  [0,1]  ,  j  =  l,...,n  ,  say,  then  it  can  be  shown  that  the 

evaluation  of  the  resulting  (finite)  string  at  any  cartesian  product 
(or  cylinder)  set  is  obtained  by  replacing  all  probability  operators 
by  the  corresponding  ordinary  set  operators  in  the  comb  function  and 
in  its  arguments,  each  probability  measure  by  a  corresponding  carte¬ 
sian  product  factor  relative  to  the  joint  probability  measure.  That 
is 

(comb(C,x  ,T  )  (p  )  )  ( B,  x  •  *  xB  /  (i) 

?or  1  n 

=  (/if  x<b  "*&  V f  )  (  (comb(C  ,x,t  )  )  (Bj  ,  •  •  ,B  )  )  , 

for  all  B.  €  b  ,  f .  e  B  ,  j  =  1 , . . . ,n  ;  n  k  1  . 

J  qj  3  qj 

Furthermore,  if  this  result  is  specialized  to  B.  =  ray  (Y.)  , 

J  qj  J 

q1 

Yj  €  R  »  j  =  l,...,n  ,  arbitrary,  one  readily  obtains 

( comb  ( C ,  x  ,  t  )(fjf  . pf  ))(B1x.-xBn)  (ii) 

^St  ”or  1  n 

=  co»b-«|.not.«&,tor)(f1{y1) .  fn(Yn)) 

=  ^“h'^not’V^or^f  « 

1  n 

a  truth  functional  system,  when  B^'s  are  restricted  to  rays.  But, 

for  more  general  Bj's  *  even  for  n  =  2  ,  probability  logic  is  not 

a  truth  functional  system  since  there  is  no  function  r  such  that, 
e.g.  , 

x  (j  )(B  x  B  )  =  r{fj  (B  )  ,  p.  (B.))  , 

12  1  2 


for  all  B.  «  B 

j  q. 


.  j  =  1,  2 


(See  the  Preface  for  a  specific 


example.  See  also  Rescher  [214]  for  further  properties  of  pro- 
babibity  logics.)  The  development  of  the  concepts  of  marginal 
probability  measures,  statistical  independence  and  conditional 
probability  measures  are  all  well  known  and  will  be  omitted  here. 
(See,  e.g.,  Neveu  [184'].) 

Motivated  by  the  above  development  for  Probability  Logic,  let 
us  reconsider  Lebesgue-Stielt jes  measures  and  Lebesgue-Stielt jes 
Logic: 

First  note  that  with  respect  to  pointwise  addition  and  scalar 
multiplication,  SBV^  is  a  vector  subspace  of  vector  space  BV^  ,  in 

the  sense  that  for  any  n  i  1  and  scalars  Ai,-**'An  €  R  •  and 


f f  «  SBV  (or  €  BV  ) 
1  n  q  q 


Vj  *  SBVq 


( or  «  BVg  ) 


> 
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Denoting 

arq  "  I  f  (9-c-d-)  €  BVq>  • 

where  \jl  ^  now  represents  a  possibly  signed  measure  corresponding  to 
f  *. 

yar  3  {fj  |  f  €  sbv  }  , 

u  *  M 

A  IR1^ 

arq  =  {fJf  I  f  €  BVq  n  C0'1!  >  ' 


y»rg  s  [fj{  |  f  «  SBrg  n  [0,1]  }  , 


it  follows  that  for  A 


,A  €  r  ,  fj  €  yar  (or  ar  ) 

lnq  4 


iV/fj  =tJ  n 

J  3  Z  A  f 

j  =  l  3  3 


e  yar  (or  €  ar  ) 

q  q 


and  hence  yar  is  a  vector  subspace  of  vector  space  ar  w.r.t. 

q  q 

scalar  multiplication  and  addition  as  defined.  Similarly,  since  for 

n  a  a 

0  <  A A  £  1  ,  1  A.  =  1  and  any  /j,  . A',  e  ya  r  (or  e  ar  )  , 

j  =  l  J  In4  4 


Z  A  m  =  » 
j=l  J  J  n 


e  yar  (or  «  ar  ) 

q  q 


r  a  f 
j=l  J  J 


Hence  yar  is  a  positive  convex  cone  of  yar  and  ar  is  a 

4  4  4 

positive  convex  cone  of  ar^  ,  noting  that 


ar  s  yar  ;  all  q  2:  1  .  etc. 
q  q 


Metrics  and  norms  can  be  introduced  on  any  of  the  above  spaces, 
using  the  relation  for  any  , 

dist  IfJf.fJg)  =  dist  (  f  ,  g)  , 

where  dist  may  be  determined  by  any  appropriate  function  space 
1  2 

norm  such  as  L  -  ,  L“-  ,  or  sup-norms. 

Analogous  to  the  Probability  Logic  development,  let  us  next 
enlarge  ar  by  the  introduction  of  constants  c  :  B  -»  (c)  which, 

q  q 

unless  c  =  0  ,  are  non-measures; 

carg  3  (c  +  /vf  |  A/f  e  arg  ,  c  e  r  } 

2  cyar  -  {c  +  u ^  |  u  e  yar  ,  c  e  R)  , 
q  f  f  q 
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where  c  e  R  is  identified  with  c  -»  {c}  and  c  +  A'f  ®q  -»  R  ' 

(c  +  pf)(B)  ^  c  +  pf(B)  ,  for  all  B  e  Bq  . 

Again,  carq  is  a  vector  subspace  of  cyarq  under  scalar 
multiplication  and  addition.  In  particular,  consider  any 

A  A 

)jf  €  arg(y*rq)  .  Then  -pf  =  e  *rq  (yarg)  and 
l  -  u,  «  car  (cy»r  )  . 

We  can  denote  1  -  p ^  as  C/J^  ,  the  generalized  complement  of 
since  for  any  y  e  Rq  , 

(1  -  /jf )  (rayg(y)  )  =  1  -  f(y)  =  Cf{y)  • 

However,  if  /j  ,  e  cyar  then  for  all  Bee  , 
f  q  q 

(1  -  Mf )  (B)  =  *»f  (CB)  +  1  -  nt  (Rq) 

which  will  coincide  with  a  measure  complement  only  when,  e.g.,  f 
is  a  c.d.f.  and  hence  is  a  probability  measure  (and  thus 

1  -  P<f(Rq  )  =  0  )  . 

Note  as  in  the  probability  logic  case,  for  any 
f  e  Brqn  [0,1]^, 

Rq 

Cf  *  l  -  f  «  BVg  n  [0,1]  and 
^Cf  =  »-{  =  “  ft  ' 

and  for  any  g  :  Rq  R  Borel  measurable, 

|  g(x)d^cf(x)  =  -  J  g(x)dpf(x)  , 

xeRq  xeRq 

and  that  Cf  is  not  grounded  In  general,  especially  when  f  is  a 
g.d.f.  Hence,  again 


"Cf  =  “"f  *  1  "  "f  =  c^f 


Now,  analogous  to  Sklar's  Theroem  mentioned  previously,  define 
lor  any  nil 


u 

q- 1  - 

j=l , . • • , n 
,  4>r€C 

q 


*1  *1 
(SBV  n  ( ( SBV  n  [0,1]  )X  ..x  (SBV  n  [0,1]  ) 

q  ql  9n 


Sufficient  conditions  (for  the  cases  q  = 


=  qn  =  1)  for 
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membership  in  1  /-  given  in  Theorem  12.  (It  would  be  of  some 

interest  to  develop  practical  necessary  and  sufficient  conditions 
for  membership  in  Cn< ) 

As  in  the  Probability  Logic  case,  the  more  general  Lebesgue- 

_q  a 

Stieltjes  development  established  that  p  :  SBV  n  [0,1]  «— »  *»r  , 

'■d  4 

d. 

p(f)  =  •  all  f  €  sBVq  n  [0,1]  is  bijective  where  again 

P  (ray  (Y) )  =  f(Y)  ;  all  Y  e  Rq  , 
and  in  turn 

C  (n)  4  <"f  I  -•  *  V 


f  q  2 ,  j=l ,  .  .  .n, 

l  Vs, 


(yar  n  (yar  x,  •  •x.  yar„  )) 
q  ql  qn 


Thus,  the  cn‘s  are  the  classes  of  relevant  Lebesgue-Stielt jes 
measures  which  arise  as  generalized  cartesian  products. 

qj 

Note  that  for  all  y  «  R  J  ,  j  *  l,...,n, 

(Pf  x--xp  ) ( ray  (Yt>  x- • x  ray  (y  )) 

1  n  ql  4n 

V - * - ' 

=  rayq(yix..xyn) 

=  (fl  >Vxfn)(yl . V 

6c 

=  VW . fn(yn)} 

=  (ray  (y.  )  ) . pf  ( ray,.  (y  )))  . 


q^  1 1 


q  -  n 


analogous  to  the  Probability  Logic  case. 

On  the  other  hand,  eq .  (i)  does  not  in  general  carry  over  here. 
Also,  since  probability  logic  is  not  truth  functional,  Lebesgue- 
Stielt  jes  logic  is  not  also,  at  the  Borel  set  level  -  even  over 
cartesian  products  of  Borel  sets. 

A 

Of  course,  all  evaluations  for  any  p^  «  -rarq  «  e.g.,  may  be 

carried  out  by  use  of  the  minimal  decomposition 

f  =  (f)+  -  (f)~ 

(f)+  ,  (f)  being  strong  BV  PM  g.c.d.’s.  Thus,  for  any  B  e  , 


P  f ( B)  =  p  (B) 
(t) 


P  _  {  B  )  , 

(f  ) 
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etc . 

Analogous  to  the  Probability  Logic  case,  define  also  t. 

“or 

Again,  as  in  the  Probability  Logic  case,  it  follows  immedi¬ 
ately  that  C-  ^  ,  t  (at  both  the  left  ray  and,  more  generally, 

”  ik  ®  & 


Borei  o-algebra  level)  are  unambiguously  well-defined  eneralized 
complements,  cartesian  products  and  sums,  i.e.,  symmetric,  asso¬ 
ciative,  min  bounded  above  for  x  ,  max  bounded  below  for  + 

T*  T> 


and  having  marginal  properties  w.r.t, 

Define  also  for  c yar  ,  e.g., 

q 


unity  and  zero,  respectively, 
generalized  cartesian  products 


distributive-wise,  analogous  to  the  Probability  Logic  case,  except 
that  l(q)  is  replacable  by  a  more  general  constant  c{q)  .  Thus 
(noting  again,  c  =  0  reduces  to  the  pure  measure  situation)  for 
any 


p/ ,  ice  eyar  and  p.  ±  c„  e  cyst  , 

r  1  1  ^3  *  ry  *  * 


<Pf  *  ci>  xd,  <Pf  *  c2) 
1  2 


=  pf  *4,  p t  ±  ci(q)  x(f&Pf 2  1  pfi  xtf,&c2(r)  +  ci 

=  Pflx*  f2  *  <t5)x*  f2  1  PflXa>  C2(r)  +  Cl(q) 


(q)  •  c2(r) 
c2(r)  , 


Similar  expansion  definitions  hold  for  C  and  t .  ,  so  that  again 

"or 

arbitrary  finite  strings  of  comb(C,  x  ,  t  )  may  be  defined 

?or 

unambiguously  upon  those  p/^'s  «  yard's  yielding  cartesian  products 
in  some  C 

n 

For  example  for  any  g.c.d.,  f  and  g  ,  the  following  DeMorgan 
and  modular  property  of  t  and  x  hold: 

Pf  1  Pg  =  1  -  d  -  Pf )  *<p&  U  -  Pg) 

=  1  -  (1  -  /if  X  1  -  1  X  Hg  +  fJf  X  fjg) 

“  Pf  X  1  +  1  *  Pg  ~  Pf  *  Pg 
=  "fig  ’ 


where  $or  may  be  chosen  modular  and  DeMorgan  w.r.t.  :  For  all 

x.Y  «  [0,1]  , 


(f  t  g)  (x,y)  =  $or  (  f  (x)  ,  g  ( y) ) 

=  f ( x )  +  g(y)  -  $&(f (x) ,g(y) )  , 

i.e.,  7  =  (<fnot>  $£<  ?or^  is  Franklan  (see  2.3.6),  as  in  the 

probability  logic  development. 
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As  a  check  on  the  associativity  and  symmetry  of  tor  '  note 
that ,  e . g . , 

bf  t  (Pg  f  Vh)  =  1  -  (U  -  Pf)  x  (1  -(  fJg  t  fjh))) 

=  1  -  (1  -  fJ{)  x  (1  -  vg)  x  (1  -  ^h) 

=  (Pf  t  pg)  f  . 

Returning  to  marginal  measures,  let  f  .  e  SBV  n  [0,1]  , 

J  qj 

j  =  l,..,,n  .  Suppose  also  that  each  fj  is  normabie,  i.e.,  there 
exists  e  R  ^  u  { -m» l  }  such  that 

"  j 

1  =  fj(Xj(0))  =  sup  g  f-jfx^.)  ;  j  =  1 . n  . 

x  .€R  J 
J 

Then  for  any  0  *  C  £  {l,.,.,n}  ,  the  C-marginals 
qi 

(f  «"xf  )  :  r  J  -♦  [0,1]  are  all  well-defined  strong  BV  g.d.f. 

1  n  c 

functions,  where  q„  =  2  q.  and  where 

C  j€C  j 
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=  *  f-|(Y-|> 

j€C  J  J 

-  {fix--xfn’c<y)  • 

Hence,  for  all  0  *  C  £  {l,..,,n} 

<'V"*fn)c  -  "<V--Xfn>0-  j€XcfJ  ' 

Let  us  call  this  condition,  marginal  consistency. 
For  given  ,  marginals  p  ^  ,  .  .  . 


are  thus 


<p&-i  ndependent  or  non-1 nteract i ve  w.r.t.  joint  measure 

^f,x  •  •  x  f 
1  n 

IRq 

More  generally,  if  f  «  $BV^  n  [0.1]  is  normable  with 
0  *  C  £  {l,...,q}  such  that 


sup  f(x  ,...,x  )  occurs  for  x.  =  x  '  ' ,  all  j  e  l,...g}  H  C  , 
<ru?  -1  i  ^  J  J 


xjelK  . 

Je{l,  ,  .  .  ,  q  H  C. 


not  depending  on  any  x.  ,  j  «  C  ,  then  the  C-marginal  function 

fc  :  Rcard  C  -»  [0,1]  exists  and  w.l.o.g.  is  a  g.d.f.  strong  BV. 

This  induces  marginal  measure  :  ®card  c  -♦  [0,1]  w.r.t. 

C 

^f  :  ®q  **  [0,1]  .  (In  the  case  of  f  being  a  c.d.f.,  one  can  choose 

I  o  \ 

x j v  '  =  +»  ,  j  e  c.)  Again,  it  follows  that 

(^f)c  -  H f  • 

C 

If  0*C,D£{l,...,q}  with  cnD=0  and  marginal 
g.d.f.'s  fc  and  exist,  then  fc  and  fQ  -  and  equivalently 

and  w.r.t.  -  are  said  to  be  (^“independent  or 

C  D 

nor.-interactive  w.r.t.  f  iff 

f  —  f  X  f 

cud  1  c  <p&  r D 

(up  to  appropriate  ordering  of  components),  i.e., 

=  Mf  x*  ^f  • 

CUD  C  D 


noting  f^  ,  fD  ,  are  all  normable,  since  f  is.  Conversely, 

if  only  f  =  fc  x  fD  ,  is  required,  fQ  ,  f  normable,  then  as  in 

the  previous  development  f  and  f_  are  appropriate  marginals, 
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etc . 

Consider  now  any  t-conorm  (or  t-co-copula )  <f>or  .  Since  sup 
is  a  special  case  of  $or  ,  the  following  extends  the  previous 

concepts  for  marginals:  Let  G*^  c  be  at  most  countable.  Let 

f  :  -»  [0,1]  be  a  strong  BV  g.d.f. 

Again,  let  0  *  C  £  { l,...q }  be  given.  Define  the 

c  3.  f  d  C 

C-<p  -projection,  proj($  )  _  ( f )  :  G  -  [0,1]  for  a  given  truth 

or  o  r  u 

functional  system  ($not«  <P&>  4>or> 

Proj(<|>or)c(f  )  (  (Xj)  jec)  =  4>or  <f(xl . xq)}  ' 

Xj  «  G 

.j«(i . qHc, 

(without  requiring  the  ^or  process  to  achieve  its  limit,  indepen¬ 
dent  of  the  remaining  x^'s)  •  Then  define  for  any 

0  *  C,D  s  { 1 ,  .  .  .  , q}  ,  C  n  D  =  0  ,  pro  j  ( 4>or ) c (  f )  and  pro j  ( <for  )Q (  f ) 
to  be  $or  -  <?>&  independent  or  noninteractive  w.r.t.  ^cuD  (and 
similarly  projection  measures  ^pro  j  ( Q  or )  (  f  )  and  ^pro  j  ( <j>  )  (  f  ) 

w.r.t.  fj  )  ,  iff 

CUD 

f CUD  =  Pr°J(1>or)C(f)  proj(tor)D(f)  . 

u 

and  hence 

MfCUD  =  Vojttor-Jc^X  MPr°j<<l>or)D'f) 

A  natural  question  to  ask  here  is  what  is  the  analogue  of  the 

marginal  consistency  relation?  Suppose  f  is  normable.  Then  since 

k  max  pointwise,  it  follows  that  proj  (<?>  )  (  f )  and 

or  o r  o 

proj  ( $  )  q  (  f )  are  normable.  Suppose  also  that  is  right  dis¬ 

tributive  over  <t>  .  (Examples  of  this  include  (min, max)  and 

or 

(prod, max).)  Then  it  follows  that, 

Pr°j(<t-or)C(fCUD)  (X) 

=  4>&(pr-oj($or)c(  f  )  ,  $or  (proj($or)D(f  )  (y)  )  ) 

yeGD 

=  4>&(  proj  (<f>or  )c(  f  )  -  l ) 

=  proj(^or)c(f )  . 

A  similar  projection  consistency  relation  holds  for  pro j ( $  ) D ( f )  . 

Note  the  relationship  between  truth  functionality  and 
■^-independence :  Even  when  f  and  are  ^-independent  w.r.t. 
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,  e.g.,  truth  functionality  still  does  not  hold  for  u, 

CUD  fCUD 

relative  to  p  and  pf  in  general,  unless  =  prod,  in  which 

rC  rD  “ 

case  (for  Probability  Logic)  statistical  independence  and 
independence  coincide. 

More  generally,  for  a  given  logical  system  (C,x,t)  ,  define 

projection  by,  for  f  and  C  as  above, 


pro j (t ) c ( f ) ( (Xj)j€C) 


t  f (x1 , . . . ,x  )  , 

x  .e  G  q 

1 

Je  (i, - qHC. 


completely  analogous  to  the  truth  functional  case. 


Conditioning 

Recall  that  if  X  and  Y  are  any  sets  and  f  :  X  x  Y  [0,1]  , 
g  Y  -♦  [0,1]  with  f  dominated  by  g  pointwise,  i.e., 

f(x,y)  g(y)  ;  all  x  e  X  ,  y  e  Y  , 

and  is  a  given  continuous  t-norm,  by  the  Darboux  Theorem  there 

is  a  class  of  functions  <f  (• |y)  |  f  (• iy)  :  X  -  [0,1]  ,  y  e  Y) 

such  that 


f(x,y)  =  { fy(x)y)  ,  g(y))  ;  all  x  «  x  ,  y  e  Y  . 


If  is  strict  Archimedean  ,  then  for  each  y  € 


will  be  uniquely  determined.  Each 
conditional  dispersion  of  f  given  y 


fY(*l*V, 


w .  r .  t . 


Y  .  f  y  (  •  I  Y  )  * 
is  called  the 
This  result  is 


related  to  the  Radon-Nikodym  Theorem.  Indeed,  if  now  X  =  , 


r  .  |R q  rD 

Y  =  3?  with  f  e  SBV  U  [0,1]  ,  g  6  53V  U  [0,1]  ,  consider 

u  f 

P  ^  :  -»  [0,1]  and  p^  :  -*  [0,1]  ,  the  associated  Lebesgue- 

Stieltjes  measures.  In  particular,  one  could  have  here 
g  =  pro jy( $or ) ( f )  (changing  notation)  since 


g  ( Y )  =  Pro  j  Y  (  4>or  )  (f  )  (y)  =  $or  (  f  (x ,  y )) 

all  xe  X 

2  max  f(x,y) 
all  xeX 

2  f(x,y)  ,  all  x  e  X  ,  y  e  y 
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r 

Assuming  all  f(- |y)  e  SBV^  n  [0,1]  ,  y  e  R  and  g  e  SBVr  n 

Rr 

[0,1]  ,  then  :  Bq  -»  [0,1]  and  :  Br  [0,1]  exist. 

Thus,  for  all  x  e  R1^  ,  y  €  Rr 


("f(-|Y>,  ^)(raYq(X>  X  rayr‘y>> 
=  (fy(-  Iy)^  g)  (x,y) 

=  <VfY(xlY)<f  ,g(Y)) 

=  f(x,y) 

=  ^f(rayg(x)  x  rayr(Y))  , 
and  hence  for  all  g  :  R^+r  -»  R  measurable, 

J  h(x,y)  dfjf(x,y)  =  J  h(x,y)d{pf  > 


x ,  ye  R 


J  (.  |  y)  <f  “  g 

_q+r  Y  1  *  'd).  r&  a 
x,yeRM  T& 


AO  (*.Y)  • 


If  in  particular  is  absolutely  continuous  w.r.t 

V  }  '  ie- ' 

Pf(B  x  C)  S  f'g(O)  ;  all  B  e  ,  C  e  Er  ,  then  by  the  Radon-Nikodym 
Theorem,  up  to  p  -measure  zero, 

g 


diUf  (x,y) 


d"f<x'y)  '  '  Sir iv) 


d^g(y)  . 


noting  that  the  conditional  -  Radon-Nikodym  derivative 


dpf (x,y) 

dpd(y) 


can  be  identified  with  p 


when  if  =  prod . 


For  example,  if  f  is  a  c.d.f.  with  g  =  pro  jy  ( (f  )  ( f )  ,  then 

g  and  f  will  satisfy  the  above  conditions. 


Bayes 1  theorea 


Fix  a  truth  functional  system  7  =  (A  ,  <t>  ,  A  )  . 

Tnot  TSt  Yor 

Let,  for  all  x  e  X  ,  f(-|x)  :  Rr  -  [0,1]  in  SBVr  be  given, 

using  conditioning  notation,  interpreted  as  the  conditional  data 

(Y)  g.d.f  .  and  g  :  Rg  -*  [0,1]  in  SBV^  given,  interpreted  as  the 

prior  parameter  (x)  g.d.f.  Then  as  a  converse  to  the  previous 

development  form  joint  parameter  and  data  (X,y)  g.d.f.  , 
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f  :  Rq+r  -#  [0,1]  ,  where  for  all  x  e  Rq  ,  y  e  (Rr 

f(x,yl  =  4>& (gU,  ,  f(y|x))  . 

Under  reasonable  conditions  (see,  e.g.,  Theorem  12), 

f  :  Rq+r  -  [0,1]  e  SBV  . 

r 

Analogous  to  the  previous  result,  if  f  ( -  j  x )  is  normable  for 
all  x  e  x  and  <f>&  is  right  distributive  over  <$>Qr  then  projec¬ 
tion  consistency  holds  for  g  w.r.t.  f  .  Call  this  condition  star 
(*)  . 

J* 

In  any  case,  determine  proj^C^^)  ( f )  :  R  -*  [0,1]  ,  the 

averaged  data  g.d.f.  and  finally  f(-  (y)  tRq  -*  [0,1]  ,  the 

poster i or  g.d.f.  implicitly  via  the  relation 

$&(g(x),  f  ( y  |  x)  )  =  f  ( x ,  y )  =  <j>&(f(x|y),  pro  jy(  <t>or )  (  f )  )  ,  for  all 

a  r 

X  €  ,  y  €  R  . 

As  usual,  these  relations  may  be  lifted  to  the  corresponding 
Lebesgue-Stielt jes  measure  level  over  the  appropriate  Borel 
o-algebras . 

Under  the  condition  (*)  ,  assumptions  for  ,  $  and 

g.d.f. 's  involved,  essentially  all  (implicit  form)  analogues  of 
conditional  Probability  Logic  calculus  extend  to  the  situation  here. 
For  example,  using  abridged  notation,  the  following  hold: 

For  all  variables  x  ,  y  ,  z  ,  and  f  :  Rp'rq',’r  -*  [0,1]  ,  SBV 
g.c.d. : 

(1)  f  (x,z,y)  =  <|>&(f  (x,z|y)  ,f  (y)  )  . 

Fixing  y  , 

(2)  f  ( x ,  z  |  y )  =  <f>&(f  (  (x|z)  |  y )  ,  f(z|y))  . 

Substituting  (2)  into  (1))  , 

(3)  f(x,z,y)  =  $&(f ( (x)z) |y) ,  f(z|y),f(y)) 

Projecting  out  z  (using  condition  (*)  yields 
f  (x,y)  =  4>&(f  (z|y)  ,f(y)  )  , 

so  that  f(z|y)  is  consistently  defined,  and  hence  (3)  becomes 
f  ( x ,  z  ,y )  =  <$>&(f  (  (x|z)  |y)  ,  f(z,y))  , 


whence  we  may  identify 

f ( (x|z) | y )  =  f (x|z,y)  . 
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Also,  we  may  always  identify 
f (x,z|y)  =  f ( (x|y) . (z|y) )  . 

Thus,  eq.(2)  becomes 

(4)  f  (  (x|y)  ,  (z|y)  )  =  4>&(f(x|z,y)  ,  f  ( z  |  y ) ) 

=  4>&(f  (  ( x  |  y )  |  ( z  |  y )  )  ,  f  ( z  |  y )  )  . 

Hence,  we  may  identify, 
f (x|z,y)  =  f ( ( x 1 y ) | ( z | y ) )  , 

the  analogue  of  the  well-known  classical  Bayesian  sequential 
"learning"  equation. 

Again,  the  Lebesgue-Stielt jes  measure  extensions  of  these  re¬ 
sults  should  be  mentioned. 

Quantifications  carry  over  to  this  example  as  in  the  general 
situation  with  little  change  in  forms.  (See  below.) 

It  follows  from  the  previous  results,  given  any  truth  func¬ 
tional  copula  system  5  =  (4*Ilot»  ,  $or)  '  resulting  Probability 

Logic  (PL)  may  be  identified  with  the  class  DPL^  of  all  well-formed 

strings  over  all  c.d.fs  '  ^•e-'  a^ 

comb(C,x  ,t  ) (f, , . . . ,f  ) 
for 

extending  uniquely  to  the  class  PL?  of  all  well  formed  strings 

over  probability  measures  pf  ,  i.e.,  all 

rl  n 

comb ( C , x  ,  t  )  (pf  )  •  which  can  be  further  evaluated 

9&  9or  1  n 

using  previous  results. 


Let  Z  be  a  given  space  called  the  base  population.  Let  X 
and  Y  be  two  other  spaces  called  the  X-attribute  and  Y-attribute 
spaces,  and  form  X  x  Y  the  Joint  attribute  space. 

Let  h^  :  7  -*  X  an^  hy  :  Z  -*  Y  be  measurement  functions. 

Thus,  e.g.,  Z  =  set  of  all  Australians  living  in  June,  1933, 
h^  =  ht  (in  feet),  hy  =  wt  (in  lbs.),  X  =  [0,10]  (in  feet), 

Y  =  [0,2000]  ( in  lbs) . 

We  wish  to  symbolize  for  two  particular  attributes  B  e  s(X)  , 
C  e  *(Y) 

ii  <quanti f ication  (for  Z  having  B  given  C)>n  . 

Let  0(Z)  represent  the  class  of  all  discrete  probability 
functions  over  Z  (including  deficient  ones). 

Let  A  e  s(X  x  Y)  represent  a  binary  compound  attribute  of 
X  x  Y  and  consider  projy(A)  e  $(Y)  . 
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Define  also,  for  some  fixed  v;eighting-function  p  e  q(Z)  , 
corresponding  to  $  1(p)  «  7(Z)  , 

C .  ( A )  =  t  ( (hy,hv)_1(A)n  4>— 1  ( p ) ) 

1  zeZ  X  Y 

C_ ( A)  =  t  ( hv~ 1 (projv(A) )  n  t_1 (p) )  . 
z«Z 

Since  C„(A)  can  be  written 

C„  ( A)  =  t  ((h  ,h  )_1(X  x  pro  jv(  A)  )  n  ^(p)) 


<P  (s,y)  =  x  r  <p  )  (s,  (x)  _y) 


XxprojY(A) 


rX  xeX  A 


*  (t  6a 


*  max(<p . (s,x)  ) 
xe  X 

1  <p.(s,y)  ;  all  s  e  X  ,  y  e  y 


'  -1  <z1>z2) 
(hx  ,hy)  1 (Xxpro jy ( A) ) 


Xxpro jY( A) ) 


(  ^  )  '  hy  (  Z  2  )  ) 


z  ♦A(hx(z1).hy(z2)) 

=  4>  _i  (zrz2)  • 

(hx,hy)  1 ( A ) 


Assuming  n  $  (p)  and  t  (•)  preserve  the  above  inequality, 

Z€  Z 


it  follows  that  $c  ^  5  <pc  pointwise. 

Hence,  (C1(A)  ]  C2(A))  exists,  where 


<M  u,v)  =  <t>&(<}> 


(u)  ,  <p  (v))  ail  u, v  e  [0,1]  . 


C:(A) 


( C ^ { A)  | C  ^ ( A ) )  C2(A) 


Now  the  class  of  all  quantifiers  such  as  "most",  "some',  "all", 
"at  least  on;",  "none",  "about  3/4",  "sometimes",  "6/7",  etc,  may  be 
identified  with  7(H)  ,  and  for  any  qu3nt  «  7(H)  with  $crUante  1 

and  any  B  e  7(W)  ,  it  is  assumed  quant(B)  €  7(W)  also  with 


^quant(B)  ~  ^quant  °  ^B 
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Thus,  letting  z  =  (z,z)zeZ  ,  form 
ll  <z  e  quant(C1(A)  |  C2  ( A )  )  )  il 

=  ^quant ( ^ ( C ^ (A) ) } C2 ( A) ){z))  ' 

=  il  (quant  of  z  (population)  w.r.t  C„(A)  have  c.(A))ii  . 

<C  J. 

To  indicate  the  restriction  of  all  c.d.f.'s  to  a  fixed  , 

e.g.,  we  use  the  notation  DPL 

"  ,  <3 

Lebesgue-Stielt jes  Logic  (LSL)  includes  Probability  Logic  as  a 
special  case  and  centers  about  the  development  of  the  class  DLSL? 

of  all  well-formed  strings  over  all  g.d.f.'s  f^,...,^  ,  i.e.,  all 

comb(C,x  ,t  )(f1...,fn)  , 

^or 

which  induces  a  corresponding  class  LSL^  of  Lebesgue-Stielt jes 


measures 


comb(C,x  ,t  )(f  . f  ) 

T&  Tor 

=  comb(c  ,x  ,  t  )(//f  ) 

?or  rl  n 


for  a  large  class  of  such  g.d.f.'s.  Furthermore,  the  restriction  of 
fl,-’*'fn  to  be  also  SBV  an<3/or  to  have  ranges  in  [0,1]  ,  carries 

over  to  the  corresponding  measures.  (For  example.  Corollary  2 
(Appendix)  states  that  f1(...,fn  g.d.f.  SBV  implies  . /jf 


1  n 

are  finite . ) . 

Note  that  when  ^i''**'^n  are  restricted  to  have  ranges  in 

[0,1]  ,  then  any  string  will  also  have  range  in  [0,1]  ,  for  either 

Probability  Logic  or  Lebesgue-Stielt jes  Logic.  In  either  case, 
using  the  fundamental  membership  function  $  ,  we  can  bijectively 

assign  attribute  A ^  to  fj  : 


A,  =  <P  1(f.)  €  7(1R  j)  _  (f  •  ir  J  -  [0,1])  , 


so  that 

fj  "  ♦a.  ' 

J 

J  =  1.  • 

comb ( C , x .  , 

)(A1 

Tor 

by 

q  •  -t-q 


comb(C,x  ,r  )  (A  . A  ) 

&  Tor 

comb(n  .x  ,t  )  (fl . r  )  , 

&  Tor 
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with  truth  evaluation,  for  any  y^  «  IR  ,  j  =  l,...,n  , 

ne  comb(C,xA  ,t  ) (A . A  )>u 

^comb(C,x  ,t  ) ( A  , . . . , A  ) ^ yl ’ ‘ ‘ ' yn ^  € 

"or  1  "* 

A  similar  procedure  may  be  carried  out  at  the  Borel  o-algebra 

level.  Thus,  for  any  B.  e  b  ,  i  =  ,  with 

J  <3  j  In 

c.d.f.’s,  now  letting  attribute  ji  .  correspond  uniquely  to  p  : 

J  T 

J 

A.  =  4>"1(pf  )  e  »(B  )  ~  (p  :  B  -  [0,1])  ,  j  =  1 . n  , 

J  j  qj  j  qj 

»<(B.)._ 1  __  n  e  comb (  C  ,  x  ,  ,t  )  (^ - <An)>" 

J  J  I  •  •  •  I  '  &  T  QJ. 

^combfC.x  ,t  )(a  ji  )(B1' - Bn) 

"or 

=  comb  (  C  ,  x  .  ,t  )  (pf  ,  .  .  .  ,pf  )(B1,...,Bn)  e  [0,1]  . 

”or  1  n 

This  symbolization  allows  a  natural  setting  in  extending  clas¬ 
sical  set  relations  and  definitions  to  attribute  forms.  Definitions 
for  generalized  unions  U  ,  intersections  n  ,  subset  relations  £  , 

incidence  relations  A  and  functional  transforms  t  may  all  be 
motivated  by  this  structure,  even  though  the  logics  involved  are 

strictly  not  truth  functional.  The  latter  condition  only  limits  the 

full  evaluatoin  of  x  and  t  in  terms  of  marginal  components. 
Instead,  evaluation  of  all  relevant  joint  c.d.f.'s  or  measures  must 
be  determined. 


Discrete  spaces 


If  in  all  of  the  previous  development,  either  iRq's  are  re¬ 
placed  everywhere  by  discrete  subspaces  or  all  relevant  functions 
are  restricted  to  be  purely  step  functions,  then  the  results  spe¬ 
cialize  appropriately.  PL  is  thus  not  only  generated  by  strings 
over  (the  prime  from  here  on  will  indicate  appropriate 


restriction  to  a  discrete  subspace  XV4; 
equivalently  by  strings  over  0^  ,  where 


of  IR4  ,  etc.)  but  also, 


h  is  an  ordinary  or  deficient  probability 

(a i  v ( 5 ) 

function  over  x'4  ,  i.e.,  h  e  [o,l] 

r  h (x )  <  1)  . 

ve  Y  ^  *2  ) 


O'  =  (Pj((-))  |  f  <=  D '  (including  deficient  c.d.f.'s)),e 


Thus , 
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Attribute-events  (This  extends  Zadeh's  concept  of  fuzzy  events 

f  285  j  .  ) 


For  any  f  e  bv^  n  [0,1]  ,  and  B  «  B^  ,  as  we  have  seen 


pf(B)  =  j  <$>B(x)df(X)  is  the  yjf-measure  of  B  . 
xeRq 

In  turn,  for  any  attribute  A  e  y  (  ir°  )  ..1th  <t>  e  if  ,  we  may 

A  q 

define 


Pf( A)  =  J  $A(x)df(x)  . 

xeRq 

Thus,  the  first  extension  of  BV^  n  [0,1]  and  its  class  of 

strings  DLSL?  over  points  in  Ry  ,  to  LSL?  over  Borel  sets  in* 

B  may  be  once  more  extended  in  its  domain  to  "Borel"  attributes  as 

q 

above . 

Also  note  that  if  B  e  is  a  finite  set,  then  induces 

D 

which  is  equivalent  to  0  . 
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»ndlx  to  2.3.9.B  Functions  of  Bounded  Variation  in  R  ■ 


The  partial  order  s  over  R4  is  defined  by,  for  all  y  = 


<=  Ry  ,  x  s  y  iff  Xj  s  yj  ,  for  j  =  1 . q 


For  any  x,y  e  R^  such  that  x  S  y  ,  define  the  corresponding 
closed  q-interval  J(x,y)  by 

J(x,y)  =  {v  j  v  «  R^  &  x  s  vs  y} 


noting 


=  x  [x  . , yi ]  , 
j  =  l  J  J 


J(x,y)  =  [x,y]  ,  for  x,y  «  R 


and  for  x  s  y  , 


J(x-lg  ,  y-lq)  =  [x.yjx • • • x [x,y]  =  [x,y] 


Also,  define  for  any  x  = 


e  Rq  , 


d 

x  a  y  = 


xlAyl 


VYq 


x  v  y  = 


xivy  i 


x  vy 
q  *q 


x j  a  y j  =  min(Xj,y^)  '  xj  v  Yj  "  max(x^.,yj)  ;  j  =  1,2 . q  . 


3(x,y)  =  J  ( XAy ,  xvy )  . 


Hence 


3(0  ,x)  =  J ( o  ,x)  ,  if  x  i.  o 

~q  '  ~q  -q 

3(0  ,x)  =  J(x,0  )  ,  if  xso  . 
~q  ~q  ~q 


For  f  :  R  -*  R  ,  define  for  any  x  S  y  e  r  ,  the  first  di  f- 
Cerunce 


d, ( f , J(x,y) ) 


f  (y) 


f  (x)  . 
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For  f  •.  R  -*  R  define  for  any 
x. 


X  = 


X, 


s  y  = 


Y  i  1  2 


1 

y2  J 


6  R 


the  second  difference 


a£(  f ,  J(x,y)  )  =  -d1(^1(f  ( ‘  ,x2)  ,  J(x1<y1))  ,  J(x2>y2)) 

*  <*2)  -  f(y1«x2)  -  f(x1.y2)  +  f(y1,y2) 

In  general,  for  f  :  Rq  -»  R  ,  and 


X  = 

■  X1  ’ 

<  y  = 

'  yi  ' 

e  Rq  ,  the  q^  difference 

X 

y 

q 

q 

4 

q 


(f , J(x,y) )  =  J , 


q 

-  z 
j=0 


<Vl(f(-'Xq}'  J< 


X 

'  yl 

: 

x  , 
q-1 

.  yq-l  . 

))  ,  J(xq,yq)» 


1  f(X!  +  V  <yi"Xl> . Xq  +  9 q  <  W>  • 

- ®  g€  (  0  <  1  )  <  1 

0  .  +  .  .  .  +0  =*  1 

i  q  J 


Let  J(xi(yj)  ,  j  =  1 ,  .  .  .  ,m  ,  be  m  disjoint  or  contiguous 

d  m 

closed  q-intervals  and  region  K  =  u  J(x.,y.)  .  Let  Q  be  any 

i  =  l  J  ^ 

partitioning  of  K  ,  where  each  J(x^.,y^)  Is  partitioned  into  dis¬ 
joint  or  contiguous  q-sub-intervals .  Define  the  variation  of  f 
over  K  v  ,r  .  t  .  Q  as 


Vr(f ,K,Q)  =  ]>  I  Aq{f ) I 

J(u, v)eQ 


and  the  total  variation  of  f  over  K  as 

7r(f,K)  =  sup  Vr(f,K,0}  . 

(all  a  over  K) 

q+  o 

The  total  variation  of  f  over  IR  and  are  defined, 

respectively,  by 

Vr(f,R  )  =  lim  Vr ( f , J( 0  ,t-l  )) 
t-.-K»  ~q  *'q 
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Vr(f,Rq)  =  lim  Vr ( f , J ( -t ■ 1  ,  t • 1)  )  , 

q  q 

etc. 

f  :  Rq  IR  is  of  bounded  variation  over  region  K  S  Rq  iff, 
by  definition, 

0  S  Vr(f,K)  <  . 

f  ;  Rq  -»  R  is  of  bounded  variation  (BV)  iff  for  all  bounded 
K  S  Rq  , 

Vr(f,K)  <  +®  . 

f  Rq  R  is  of  strong  bounded  variation  (SBV)  iff  f  is  of 
bounded  variation  over  Rq  ,  i.e., 

Vr  (  f  ,  Rq)  <  +«®  . 

f  Rq  -♦  IR  is  nondecreasing  ( ND )  iff  for  all  x  S  y  e  R^  , 
f(x)  s  f(y)  .  It  follows  that  f  is  nondecreasing  iff  f  is 
nondecreasing  in  each  argument  separately, 

f  :  R^  -»  r  is  positive  increment  monotone  (PM)  iff,  for 

x  £  y  e  Rq  ,  a  (f,J(x,y))  2  0  . 

Clearly,  if  q  =  1  ,  f  :  R  -»  »  is  ND  iff  f  is  PM 

However,  for  q  5  2  ,  neither  property  in  general  implies  the  other. 
For  example: 

o 

If  f , g  ;  R  -»  R  are  arbitrary,  then  h  :  R  -*  R  defined  by, 
for  all  x,y  e  r  , 

h(x,y)  =  f ( x )  +  g(y)  , 


is  such  that  for  all  x 


d 2 ( h , J (x , y ) )  s  0  , 

yet  f  and  g  may  be  chosen  each,  e.g.,  decreasing  in  x  and  y 
separately  and  hence  h  is  not  ND  but  rather  decreasing. 

2 

On  the  other  hand,  f  R  -»  R  given  by,  for  all  x,y  e  R  , 
f(x,y)  =  max (0,min(l,l-(l-x) ( 1 —  y ) ) )  , 
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X.  y 

is  ND  ,  but  for  any  x=  <  y  =  * 

X2  [  y2  , 


2  (  f  .  J  (x,  y )  )  =  -(Yj-Xj)-  (Y2-x2)  . 

Let  f  ■.  Rq  -  ir  ,  c  U  D  =  {1,2,.  .  ,  q }  , 

Wlth  C  =  <jl . V  '  1  S  Ji  <-,‘<4  *  <J  '  1  ^  m  S  q  , 

°  =  (kl . kq-m}  '  1  5  kl<'''<kq-m  S  3  and  Y  =  Tkj.  €  iRq_m  . 

yk 

q-m 

Then  the  ( C ,  y )  -marg  i  na  1  function  f  :  iRm  -  (R  Is  defined  by,  for 

X  J 

all  x  =  :  1  €  Rm  , 


fc  v(x>  =  f (x,y) )  , 


where 


1  [  «j  •  «  I  ■  i,  .  1  !  .  <  ■ 

7 (x,y)  =  2  2  ;  zi  =  1 

:  ^kt  '  i  =  ,  1  <  t  S  q~m  . 


If  also,  sup  (fc  )  <  +®  ,  occurrinj  at  some 
y  e  Rq_m 


Yo  €  R  u  {+<0-1„_™>  -  then  f„  =  f 


c  ( c «  yd ) 


_m  q 

R  -•  R  is  called  the 


C  marginal  function  of  f  . 

Let  f  iRq  if  .  Define  f  to  be  hereditary  Ph  irr 


for  all 


0  *  c  s  q}  ,  and  all  x  e  Rm  ,  f  ;  Rcard(C)  _  R  is  pM 

c ,  X 


On  the  other  hand,  define  f  to  be  grounded  iff  f 


or  any 


«  Rh  ,  f { x )  =  0  ,  if  there  is  at  1 


east  one  vike 


x^  ;  1  s  i  5  q  such  that 


—  CO 
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Lemma  1 . 

Let  f  :  lR<^  -*  R  . 

(i)  If  f  is  PM  and  grounded,  then  f  is  hereditary  PM. 

(ii)  Suppose  f  is  PM.  Then  if  either  f  is  hereditary  PM  or 

grounded,  f  also  ND.  In  the  case  of  f  being  grounded,  ND,  in 

turn,  implies  .hat  f  >  0  . 

Prop  f_: 

(i)  For  example,  for  q  -  2  ,  for  a^ ( f , J ( x , y ) ) , 


r  yi  i 

'  Yi  ' 

X  =  1  y  *  j 

t  “2  j 

.  Y2  . 

letting  x2  -♦ 


yields  0  <  f(1)/y2(y1)  -  f{1),y2(y2)  =  f(y1(y2)  -  f(xry2)  -  o  +  o  . 


a  a 


From  the  definition  of  A  ,  for  x  S  y  e  if? 

q 


f(y)  -  fix)  = 


K  A 

c  ard(C) 

0  S  C  5  {1 . q)  , 

j 

D  -  (1 - ,  q }  -•  C 


Cfc,(xio'^x|D)'  fY|D)i>  - 


(x|C)  indicating  the  restriction  of  x  to  C 
Thus,  hereditary  PM  implies  ND  . 

If  f  is  grounded,  let 


etc . 


x  = 


'  xi  ' 

—CD 

xj-i 

Xj 

H 

> 

V) 

Yi 

— CD 

xj  +  i 

—CO 

X 

q 

j  =  1,2, 


Then,  takir.g  limits 


0  <  uq(f,J(x,y)>  =  f( 


■  xi 

Vi 

Xi-1 

y  j 

)  -  f( 

xi 

xj+i 

x  j+ 1 

.  Xq 

.  xq 

j  *  1,2, . 


implying  f  is  ND  . 


For  any  f 


R  define  the  total  variation  function 
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r  (  f )  :  Rq  R  +  by,  for  all  x  = 

d  ,  ..Vx) 


X 


€  R 


r(f)(x)  =  (-1) 


Vr{f ,J(0  ,x) )  , 


where 


v  (x)  =  card  {j  j  1  £  j  S  q  &  x.  <  0} 

'“d  %J 


Le— a  2 

Let  f  :  Rq  -*  R  . 

(1)  If  f  is  PM  then  f  is  BV  with  for  all  x  s  y  e  Rq  , 

Vr ( f , J ( x , y ) )  =  Vr(r ( f ) , J(x,y) ) 

=  dq(f , J(x,y) ) 

=  (f ) , J(x, J) )  . 

(ii)  If  f  is  BV  ,  then  for  all  x  £  y  e  Rq  , 

Vr(f , J(x,y) )  =  Vr (r (f) ,J(x,y) ) 

=  (f ) , J(x,y) )  , 

>  Mq(f  ,J(x,y)  )  |  , 

and  hence  r  ( f )  is  PM  .  In  addition,  Vr(f,J(x,y))  as  a  function 
of  intervals  J(x,y)  s  Rq  is  finitely  additive. 

(iii)  If  f  is  PM  and  grounded,  then  for  all  x  e  Rq  , 

Vr  (  f  ,  J  (  -®lq  ,x)  }  =  dg(f ,J(-»lq,x) )  =  f  (x)  . 

Theorea  1  (McShane,  [174],  p.  250) 

( I )  Let  f  :  Rq  r  be  BV  .  Then  there  exist 

(f)  +  ,(f)~  :  Rq  R  ,  PM  such  that 

f  =  (f)+  -  ( f ) “ 

is  a  minimal  decomposition,  i.e.,  for  all  such  possible  represen¬ 
tations,  (f)+  and  (f)  have  minimal  total  variation.  (f)+  and 
(f)  may  be  defined  by 
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(f)+  =  |(r(f)+f)  ;  (f)  =  I(r (f)-f)  , 

Thus  , 

r(f)  =  (f)  +  +  (f)~  . 

Also,  tor  all  x  S  y  €  Rq  , 

-*»  >  Vr  (  f  ,  J  ( x,  y ) )  =  Vr(r  (f )  ,  J(x.y)  ) 

=  Vr( (f)  +  ,J(x,y) )  +  Vr { ( f f  ,  -  J(x,y)) 

=  dg((f)+,j(x,y)}  +  4  ((f)“,J(x,y>)  =  4  (t (f) ,J(x,y) )  . 

If  also  (f)+,(f)~  are  grounded  -  and  hence  f  is  grounded 
then  for  all  y  e  Rq  , 

d  (f ,  J(— 1  .y) )  =  A  (  (f )  +  ,  J<-®1  y)  )  -  A  ( (f)",J(-«i  y)  ) 

St  Vi  ,1  Vd  ‘■d 

=  (f)+(y)  -  (f)"(y)  =  f(y)  • 

(ii)  Conversely,  if  f  ,f  :  R<3  -»  R  are  arbitrary  and  PM  then 
f  =  f  -  f  ;  Rq  -  R  is  BV  . 

Remarks . 

1.  The  identity  function  id  :  R  -*  R  is  PM  with 

Vr  (  f  ,  J(  ~t ,  t ) )  =  2t  ,  for  all  t  >  0  . 

2.  Let  Q  =  {J(Xj,Yj)  |  j  =  l,...,m)  be  any  collection  of 

disjoint  or  contiguous  closed  q-intervals  and  a  ,,.,a  any 

l  m 

m 

constants.  Then  the  step  function  f  =  2  A  .  is  of  BV 

j-1  J  J|Vyj’ 

with 

m 

Vr(f , [~t,t]q)  =  z  | A  |  •  2q  . 

j  =  l  J 

3.  For  any  f  R  -*  R  ,  if  f  is  PM  or  NM  (non-increas¬ 
ing),  or  a  finite  sum  of  bounded  unimodal  or  multimodal  (finite 

number  of  modes)  functions,  then  f  is  BV  . 

Lemma  3 

Let  f  :  Rq  -*  R  be  PM  (and  hence  BV)  and  grounded  (and 
hence  by  Lemma  1,  ND ) .  Then 

f  is  bounded  iff  f(+®l  )  <  +®  iff  f  is  of  SBV  . 

***  Q 
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Proof : 

If  f  is  ND  and  bounded,  it  follows  that  f{+®lq)  <  . 

If  f(-H»lg)  <  -t-®  ,  then  for  all  t  >  0 

Vr{f , [-t,+»]q)  =  , J(-t- lq,+~lg) ) 

q 

5  J  5  £l+-iq> 

0X . 9q  e  (0,1) 

V-'+®q  =  * 


q 

s  'y  •  [  q  ] - f ( +<*> i  )  =  2q-f(+®i  )  <  +®  . 

j=o  [  j  J  ~q  ~q 


Taking  limits  as  t  -♦  +®  ,  yields  f  is  SBV  . 
Finally,  if  f  is  SBV  ,  by  evaluating  for  t  >  0 

q-1  u 

>  Vr  (  f  ,  [ -t , +® ]q)  =  f(+<»l  )  +  z  (-l)q  •'•g.(t)  , 

“q  j=o  3 


each  g  j  ( t ) 


is  a  sum  of 


terms,  each  of  which  contains  at 


least  one  term  of  the  form  f  ( •  •  •  , -t ,  •  •  •  )  ,  which  by  the  grounding 

property  of  f  ,  for  t  sufficiently  large,  is  arbitrarily  small. 


Hence , 


f(+”lq)  <  • 


Theorem  2  (McShane  [174],  pp.  125,  230)  -  Basic  approximation  I. 

(i)  Let  f  :  Rq  -«  R  be  bounded  and  (Borel)  measurable.  Then,  for 

each  t  >  0  ,  f  is  thus  ( Lebesgue )  integrable  over  [-t,t]q  and 
hence  there  exists  a  sequence  of  step  functions  and  hence  strong  BV 
functions  such  that 


=  2 
j-i 


A  t , n, j ’ (x 


t ,n, j ' yt ,n, j ' 


(ii)  Let  f  :  Rq  is  be  integrable  over  Rq  .  Then  there  is  a 
sequence  of  step  functions  -  and  hence  strong  BV  functions. 
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j  11 

:  S  ^  a 

n  Z  nj  T 


j  J{xnj'Ynj} 


in  R 


n  =  1,2,3...,  such  that 


lim  J  |  f(x)  -  fn(x)  |  dx  =  0  . 


(iii)  Let  f  :  r4  R  be  such  that  there  is  an  t  >0  such  that 


(locally)  over  R4  ^  [-t  ,tQ]M  ,  f  is  of  BV 


For  example,  f 


may  be  PM  or  NM  over  R4  -i  [-t  ,t  ] 
1  1  o  oJ 


Suppose  over  £  —  t  , t  j 


f  is  bounded  and  measurable  or  integrable.  Then  there  exists  a 

sequence  (f  )  ,  .  .  of  BV  functions  f  :  Rq  -»  R  such  that 

4  n' n=l , 2 , . 


of  BV  functions  f 


R  such  that 


I  I  f(*>  -  fn(*) 


dx  =  0  . 


Proof :  For  example,  if 


is  bounded  and  measurable  over 


[-t  , t  ]q  ,  and  f  is  BV  over  Rq  ^  [-t  ,t  ]q,  define 
o  o  ■  o  o 


fn  "  ♦ 


f^.  _  +  $  „  • 

>Pq4[-t0,to]q 


f  ;  n  =  1 , 2 . 


Remark :  Result  (iii)  above  can  be  modified  for  f  assumed  to  be 

SBV ,  whence  (f  )  .  „  becomes  a  sequence  of  SEV  functions. 

Let  f  :  Rq  -♦  R  be  PM  .  Recall  that 
Vr(f,J(x,y))  =  <4  ( f , J (x, y) )  ,  as  a  function  of  closed  intervals 

J(x,y)  s  Rq  is  a  finitely  additive  non-negative  interval  function. 
Then  from  standard  measure  theory  constructions  (analogous,  e.g.,  to 
Lebesgue  measure  theory),  a  countably  additive  non-negative  measure 

extending  A(f ,•)  is  induced  over  y(/^,Rq),  the  c/ass  or  alt 


^  ^-measurabl e  subsets  of 


,  a  o-algebra.  In  addition,  let 


be  the  class  or  all  \i  ^  -me  a  surab I e  tunctions  g 

g~ 1 ( B )  €  y(pf,Rq)  for  all  Be  and  L1(pf,B)  S  *(pf, B)) 

the  class  of  all  /j  -/ nt  egrab  I  e  runctlon  s  over  B  . 


to  be 


Theorem  3  (McShane  [174],  pp.  262-263,  extended) 


Let  f  :  Rq  R  be  PM  (and  hence  BV)  .  Then 
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(i)  ®g  s  y(^f,(Rq) 

and 

(ii)  <Fq  s  *(pf,Rq) 


(since  for  any 
g-1(B)  e  y(pg,R 

B  €  B 

g))  • 

,  and  g  e 

F 

q 

g_1 

(B)  e  and  hence 

(ill) 

If 

n 

g  =  r 
j  =  l 

Aj  ♦a.  ' 

1 

A  ^  • 

.  .  ,  A  € 

F 

q 

with  A  ,  .  , 

r 

’  ,An 

real  constants, 

r 

then 

n 

J 

x  e  Rq 

.g(x)df(x)  = 

X 

J  g ( x ) dp 

e  Rq 

f  (x) 

II 

(-1. 

II  M 
>-» 

Aj‘  J  <t>A  (x)dpf(x) 

x  e  Rq  J 

J  <t>  (x)dp  (x)  =  f  dp  ( x )  =  p  ( A.)  , 

q  j  .  f  J 

X€Aj 

etc . 

.Theorem  4  (Dominated  convergence,  McShane  [174],  p.  276) 

Let  f  :  Rq  -  K  be  PM  .  Let  g  :  Rq  _  r  be  such  that 
g  €  2(fjf,!Rq)  and  g  is  bounded  by,  say,  Mq  <  +®  .  (This  is 
guaranted  if  9  «  Fq  and  g  is  bounded  by  Mq  .  )  Then,  for  any 
J(u,v)  €  Rq  and 

•f  g(x)df  (x)  <  X  | g (x)  | df  (x) 

x  e  J  ( u ,  v )  x  e  J  ( u ,  v ) 

s  M  •  X  d  f  ( x ) 

X  €  J(u,v) 

=  Mo-pf ( J(u,v) ) 

=  Mo-Vr(f ,J(U,V) ) 

<  , 

9  e  (pf , J(u,v) )  . 


and  hence 
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Theorem  5  (McShane  [174],  p.  287) 

Given  f  :  Rq  -♦  R  BV  ,  let  for  any  t  >  0  , 
f  =  (f)+-  (f)~ 

be  any  minimal  decomposition  of  f  into  (f)  +  and  (f)  (PM)  . 

Then  f  induces  a  complete  regular  countably  additive,  signed, 
measure  which  extends  interval  function 

4  ( f i • )  =^q((f)+,-)  -4  ((f)-,-)  =Vr((f)+,-)  -  Vr ( ( f ) ~ , • )  ,  such 

that 

B  s  y(p  ,Rq)  =  , Rq )  n  y(p  ,Rq)  , 

q  r  (f)  (f)' 

and 

r  s  sf  (/j  ,Rq)  =  ,  Rq )  n  ff(fj  ,Rq)  , 

q  r  (f)  (f) 

and  for  any  B  e  y(/L/f,  Rq) 

L1(/i-,B)  =  L1^  ,  , B )  n  LX(/u  , B )  , 

C  (f)  (f) 

Also,  for  any  g  e  L1(A/f,B)  , 

J  g(x)d/jf(x)  =  J  g(x)df{x)  =  |  g(x)d(f)  +  (x)  -  J  g(x)d(f)~(x) 

xeB  xeB  x«B  xeB 

is  consistently  defined,  independent  of  the  particular  (minimal) 
decomposition. 


Remark : 

If  in  Theorem  5,  c  is  any  constant,  (f)+  may  be  replaced  by 
(f)+  +  c  and  (f)~  by  (f)  +  c  ,  with  no  other  changes. 


Theorem  6.  (McShane  [174],  pp.  291-292) 


Let  f 

:  Rq  - 

R  be  PM  . 

Then  there  exists 

?  :  Rq  R 

which 

is  PM  , 

right 

continuous 

and  grounded  and 

hence  ND 

and 

non-negative 

,  such 

that 
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(1)  y(/i.,Rq)  £  3*  l/J.  ,Kq)  , 

r  f 


and  if  Be  y(pf,Rq)  with  M  f  ( B  ^  <  +oB  then  pf(  B)  =  (B)  . 

( i  i )  For  any  J(x,y)  e  Bo  s  y(pf,iRq)  , 


Pf ( J(x,y)  ) 


(J(x,y) ) 


dq(f . J(x,y) ) 


etc . 


Also,  as  in  Theorem  1, 

P  ( J(-«l_,y) )  =  a  (?, J('«l  ,y))  =  ?(y)  , 

a  ~t3  q  ~q 


(iii) 

S(pf,Rq)  £ 

3(p-.,u?q)  , 

f 

( iv) 

L^p-.B)  ; 

f 

all  B  « 

:f(pf,Rq)  and  if 

3  €  L1(pfl 

B)  ,  then 

J  g ( x) df ( x) 

=  J  g  ( x )  d?  ( 

x)  . 

xeb 

xeB 

Call 

any  such 

?  as  in 

Theorem 

6,  the  equivalent 

generalized  distribution  function  ( 

g.d.f.  ) 

w.r.t.  f  . 

Corollary  1 


Let  f  :  Rq  -•  R  be  BV  with  minimal  decomposition 

f  -  (f)+  -  (f)  .  It  follows  immediately  from  Theorem  6  that 

extending  the  above  definition,  there  is  an  equivalent  generalized 

distribution  function  ?  w.r.t.  f  ,  i.e.,  ?  =  (?)  +  -  (?)  is  BV, 
right  continuous  and  grounded  such  that  all  of  the  results  in 

Theorem  6  now  hold  for  this  f  ,  since  (?)+  is  a  PM  g.d.f. 

equivalent  to  (?)  and  (?)~  is  a  PM  g.d.f.  equivalent  to  (f)~  . 

Hence 


/y  (j(-®l  ,y))  =  P  (J(-«i_,y))  =  4  (?, J(-~i  ,y) )  =  ?(y)  , 

-q  q  ~q 


all  y  €  jr 


f'  '  ~q 
q 


etc . 
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Theorem  7 

Let  f  :  Rq  -»  R  be  of  SBV. 

Then 

(i)  There  is  an  equivalent  generalized  distribution  function 
?  :  ir^  -»  R  w.r.t.  f  with 

?  -  (?)  +  -  (?>" 


a  minimal  decomposition,  (?)  +  equivalent  to  f+  and  (?)  equi- 
valent  to  f  (generalized  PM  distributions)  such  that  (f)  , 
(?)~  ,  and  ?  are  all  SBV  . 


(il) 
for  any  x 
1  s  i  s  q 


If  also,  f  is  c -grounded  for  some  constant  c  ,  i.e. 
X1 

:  e  Rq  ,  f(x)  =  c  ,  if  x^  =  -«  for  some  i  , 


then  we  can  define, 


v  v 

?  =  af-  (f )  -  pf  •  (f  )  +  c  , 


where  af  ,  /Jf  i  0  constants  and  (f)  +  and  (f)  are  legitimate 

cumulative  probability  distribution  functions  over  (c.d.f.'s), 

v  +  v  _ 

i.e.,  (f)  and  (f)  are:  PM,  grounded  and  hence  nondecreasing 
nonnegative  right  continuous,  and  SBV  with 


(f)+  (4-1  )  =  (f)  (4-1  )  =  l. 

(iii)  Conversely,  if  f  =  -  f2  ,  where  f 1  and  f2  are 

PM  generalized  distribution  functions,  then  f  is  a  g.d.f. 

(iv)  Conversely,  if  f  =  a- f1  -  bf2  +  c  ,  where  a,b  >  0 

constants  and  c  is  a  constant  and  f^  and  f2  are  c.d.f.'s,  then 

f  is  a  c-grounded  generalized  distribution  function.  Note  that  for 
any  q-interval  K  , 

Vr(f.K)  =  aVr(fx,K)  +  bVr(f2,K)  . 


Proof : 

(i)  f  SBV  implies  for  the  minimal  decomposition 

f  =  (f)+  -  (f)  ,  (f)+  and  (f)  are  PM  and  SBV  .  Theorem  6 

implies  ?+  and  ?  are  equivalent  PM  generalized  distribution 
functions  which  are  strong  BV  and  by  Lemma  3. 


) 
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0  <;  af  =  (?)+(+»l<3)  .  P  f  =  {?)"(+“•  lg)  <  +»  ,  etc. 

Theorem  8  (McShane  [174],  pp .  258,  291,301) 

Let  a  £  »(Rq)  be  a  (non-empty)  o-algebra  and  p  :  a  ir  a 
complete  regular  countably  additive  signed  measure  over  a  .  Then 

there  exists  f  :  Rq  -►  R  g.d.f.  such  that  p  extends  interval 
function  .4  {  f  ,  •  )  and  such  that 

(i)  A  <=  /(pf,Rq)  , 

(ii)  *(p,Rq)  £  *(pf,Rq)  with  p(B)  =  pf(B)  ,  for  all 
B  e  tf(p,Rq)  with  p  ( B )  finite, 

(iii)  L*(p,B)  £  L*(p^,B)  ;  all  B  e  a  and  for  all 
f  €  LX(p .Rq)  , 

|  f(x)d p(x)  =  J  f(x)d pf(x)  . 
xe  B  xe  B 


Corollary  2 


(i)  Let 


R  be  a  complete  regular  countably  additive 


finite  signed  measure  over  a  .  Then  there  is  an  f  ;  Ry  -»  R  gen¬ 
eralized  distribution  function  which  is  SBV  such  that  (i),  (ii), 
(iii)  all  hold  in  Theorem  8  (  p  extending  interval  function 
Vr  ( f  ,  •  )  )  . 

:  Rq 


:  i i )  If 


is 


SBV 


then  there  exists  a  complete 


regular  countably  additive  finite  signed  measure 
extending  interval  function  Vr(f,-)  . 


over 


Proof : 

(i)  Use  Theorem  8  to  obtain  f  inducing  p  .  Then  since 
+~  >  p(Rq)  =  pf(Rq)  =  Vr  ( f  ,Rq)  , 

f  must  be  SBV. 

(ii)  Use  Theorem  7,  yielding  7  SBV  general  distribution 
function  equivalent  to  f  and  hence 

pj*q)  =  p  (Rq)  =  Vr(?,Rq)  =  Vr(f,Rq)  <  +®  . 
f  t 
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Call,  for  any  BV  f  :  R4  -<  ir  , 
associated  Lebesgue-Stiel t jes  signed  measure. 


:  Bq  -  [0,1]  ,  the 


Theorea  9  (Hahn-Jordan  decomposition) 

Let  f  be  any  c.d.f.  over  R4  .  Then  there  exist  constants 
1  £  A1(f),A2(f),*3(f)  *  0  ,  A  j ( f )  +  A  2 ( f  )  =  1.  A  2 ( f )  =  Ag(f)  +  A4(f) 

and  hence  A  x ( f  )  +  A  2 ( f )  +  A  3 ( f )  =  l  and  c.d.f. 's  f1#  f2<  f3>  f  , 
according  as  A ^ ( f ) , a 2 { f ) , A 3 ( f ) , A 4 ( f )  are  positive  such  that  f 
is  discrete  (finite  or  countable),  f2  is  continuous,  f  is  abso¬ 
lutely  continuous  and  hence  probability  density  function  (p.d.f.) 

<3qf  3 

qT7 .  g  '  exists  in  R4  almost  everywhere  (Lebesgue)  and  f  is 

continuous  and  singular,  where  there  is  a  set  ss  *■  A  £  R4  with 

vol  (A)  =  0  and  J  df  (t)  =  ,  (A)  =  1  ,  and 

q  4  r 

t€A 

f  =  A1(f).f1  +  A  2 ( f ) • f 2 

=  AJ(f).f1  +  A  3 ( f ) • f  3  +  A4(f).f4  , 
t2  =  ( a  3 ( f ) /A  2 ( f ) ) *  f 3  +  (A4(f)/A2(f)).f4  ,  etc. 

Conversely,  any  probability  mixture  f  =  A1-^1  +  A2-f2  +  A3-f3  as 
above  determines  a  legitimate  c.d.f. 

Remarks: 

Thus,  if  f  :  R4  -♦  R  is  of  strong  BV  ,  then  f  may  not  only 

be  decomposed  into  its  minimal  forms,  written  w.l.o.g.  through 

Theorem  7  as 


f  £  (f)  - 

(f)“ 

=  af • ( f )+  - 

pf-  (f r 

+  C  , 

but  by  Theorem  9 , 
decomposed  so  that 

the 

c.d.f. ' s 

V 

(f)  may  be  further 

f  -  „  (1) 
f  *  nf  • 

f  (1) 

r  ( 3 ) 

^f(41 

'f(4)  , 

where 


(1)  v  +  v  - 
nf  =  af'A  j<<c>  )  "  )  • 
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,  v  v  _ 

f ( J}  S  «f • (f ) +  -  ;  j  =  1,3,4  , 

fjjj  is  a  discrete  generalized  distribution  function, 

is  an  absolutely  continuous  generalized  distribution  function, 
f  .  is  a  continuous  and  singular  generalized  distribution  function, 


Hence , 


"f  =  " 


(f)  (f) 


Lemma  4 


Let  a  <  b  be  constants  and  define  for  all  u,v  e  IR  , 
G(u.v)  =  max(o,min(H-^-|  ,  £-^-§  .  1  ]]  . 

Then  for  any  h,k  k  0  , 


["  1 £)>>  ■ 


-i—  „i„(h.k)  .  if  f  "  <  v  <  "  +  ? 
-a  (,v<u<v  +  k 


,  if  u  +  h  <  v  or 
v  +  k  s  u 


Lemma  5 


Let  a  <  b  be  constants  and  define  for  all  u,v  e  R  , 


,u  -  a  v  -  a, 
b  -  a  '  b  -  a1 


,  if  as  u,v  s  b 


H(u.v)  S  max[o,min[H_l^  -1  ]]-  ^  v  >  b 

max j^O  ,  min  1 , ^  j  ,  if  u  >  b 

0  ,  if  u  S  a  or  v  S  a  , 

2 

where  F  is  an  absolutely  continuous  c.d.f,  over  [0,1]  with 
p.d.f.  f  bounded  by  cq  . 

Then  for  any  h,k  2  0  , 


•["  1 1)>>  -  JJ  1  2  f(l-H  ■  S-H)  dxdv 

(  D  “  a  ; 

max( a , u )SxSmin{ b , u+h) 
max{ a,v)SySmin(b,v+k) 


Sy»bolization  and  Evaluation  of  Language 


197 


Theorea  10 

Let  f,g  :  R  -*  R  both  be  of  BV. 

2  2 
Let  H  :  R  -*  IR  be  defined  as  in  Lemma  5  and  define  H  R  -» 

R  by,  for  all  x,y  e  R  , 

H(x,y)  =  H(f (x) ,g(y) )  . 

Then  H  is  of  BV  . 

Specifically,  for  all  t  >  0 

_  c 

Vr (H , K  )  s  - 5 —  • Vr ( f , [ -t , t ] ) • Vr ( g , [ -t , t ] )  . 

(b  -a) 1 


Proof :  Use  Lemma  5  directly. 


Reaark: 

Theorem  10  may  be  extended  in  the  obvious  way  to  n  variables. 
It  also  can  be  modified  -  by  replacing  BV  by  SBV  for  f  and  g  - 

to  yield  H  being  SBV  . 


Theorea  1 1 

Let  f,g  :  R  -♦  R  both  be  of  BV. 

Also  suppose,  for  any  t  >  0  ,  f  is  continuous  over  [-t,t]  . 

except  possibly  on  a  finite  subset  of  [~t,t]  of  cardinality  mf 

r  /  l 

and  similarly  for  g  over  [-t,t]  ,  continuity  holds  except  possi¬ 

bly  over  a  finite  subset  of  cardinality  m  . 

9ft 

Suppose  finally  that  for  all  t  >  0  ,  over  [~t,t]  ,  f  and  g 

are  such  that  there  exist  finite  integers  n.  .  and  n  .  ,  such 

f  ,  t  g  ,  t 

that  for  any  u  e  rng{f)  ,  v  e  rng(g)  , 


f  a(u)  = 


f  ,  t ,  u 
U  If 
k=l 


f , t , u, k' 


g-1 (v) 


ng  ,  t ,  v 

k=1  Ig,t,v,k 


where  each  ,  ,  ,  I  .  ,  denotes  a  general  interval  in  R 

f,t,uk  g,t,v,k 

(i.e.,  either  a  single  point,  an  open  or  closed  or  half-open,  half- 
closed  interval  or  a  ray,  etc.),  and  where 


n-  .  5  n,  .  ,  n  .  ^  n  ^  . 


(This  concept  is  related  to  the  Banach  indicatrix. 


See ,  e  .  g  .  , 
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[181*3,  PP-  225-229.) 

p  p 

Let  G  :  R  -•  R  be  defined  as  in  Lemma  1.  Let  2  :  R  -»  R  be 
defined  by,  for  all  x,y  e  R  , 

3 ( x , y )  =  G(f (x) ,g(y) )  . 

Then  2  is  of  8V. 

Specifically,  there  is  a  finite  constant  i  >  0  not 

dependent  on  a  or  b  ,  such  that 

Vr(3,K  )  5  *  /(b  -  a)  ;  all  t  >  0  . 


Proof : 


Let  t  >  0  be  arbitrary  fixed. 

Let  the  discontinuities  of  f  over  [~t,t]  be  denoted 


-t  5  x(1,t)  <  x(2,t)  < 


(m,  , t, 

<  x  '  5  t 


while  the  discontinuity- 


over  [-t,t]  be  denoted 


-t  S  yU.t)  ,  y(2.t)  <  ..  <  y'”g .f"  s  , 


Let 


Q1  =  <x0*xl . xm}  • 


where 


-t  =  xn  <  x,  <•••  <x  =t 
0  1  m 


represent  any  partitioning  of  [-t,tj  as  x-axis  and 

°2  ‘  (Y0'Yl . V  • 

where 


-t 


<  v:  < 


t 


is  any  partitioning  of  [~t,tj  as  y-axis. 

Thus  a  x  a2  is  the  corresponding  rectangular  a  roduct) 

partitioning  of  K  =  [-t,t)2  . 


Refine  Q 

for  any  r,  15 

unique  integers 
using  w . 1 . o . g . 


and  Q2  to  and 

r  <  ^  and  any  s  , 

i(r)  ,  0  5  i  ( r )  5  m- 1 
the  same  notation  for 


Qj  .  respectively,  so  that 
1  5  s  5  nig  ^  ,  there  are 

.  j (s )  ,  1  <  j  (s )  <  n-1  , 

Q  .  as  for  Q’.  ,  j  =  1,2, 

J 


such  that 
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xi(r)  5  *(r>  <  xi ( r ) +1  <  xi ( r ) +2  < 


<  xi ( r+1 )  *  x 


(r+l ) 


suitably  modified  if  needed  for  r  =  t#  and  similarly 
,(s)  „  „  „  . .  „  „  <  y(S+l'  < 


v  <  V  ' 

■j(s)  Y 


<  yj(s)+l  <  ’  <  Yj(s+1) 


Yj(s+l)  +  1 


"n  turn,  define  for  all  1  s  r  s  m,  v  ,  1  <  s  <  in  .  . 

r  #  V-  y  r  t 


Cr  ■  ,xi ( r ) +1 • 

Dr  i  blsl-l1  ' 


and  par :itionings  of  C  and 


c  =  (x.,  .  ,,  x.  ,  .,  x.,  ,  ,}  x  r-t, t]  , 
r  a ( r ) -1  a ( r )  i(r)+l 


3r  =  c-t.t]  x  {yj(s)_r  yj(s) -  yj(s)+1 

Also,  define  w.l.o.g. 


>  • 


£r  s  ^Xi(r)+1’  xi(r+l)-l^  x  ^Yj(s)+1'  Yj(s+1)-1^  ' 

rating  that  f  is  continuous,  and  hence  uniformly  continuous  over 
the  first  factor  and  similarly,  g  is  uniformly  continuous  over  the 
second  factor. 

Let 

*r,s  =  { Xi ( r ) +1 ' xi ( r ) +2 - ,Xi(r+l)-l} 

X  {Yj(s)+l'Yj(s)+2' - Yj(s+1)-1} 

be  the  corresponding  rectangular  partitioning  of  Cy  . 

It  follows  from  the  additivity  property  of  V  ( 5 , [ -t , t ] ,? ) 
w.r.t.  Q-intervals  in  ?  for  any  partitioning  f  of  [~t,tj  that 


VG'Kt'W 

mf ,  t 


m 


g- 1 


i  Z  V  ( G , C  ,C  )  +  Z  V  (G.D  ,CD  )  +  Z  V  ( G , E  ,C  )  . 

r=2  s  =  l  isrsmf 

l<ssm 

g  - 1 

Consider  now  Lemma  5.  Define  the  range  values  (not  necessarily 
distinct ) 


f(x1)  ,  I 


0 , 1  ,  .  .  .  ,  m  , 
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vj  =  g(Yj )  .  j  =  0,1 . n 


Define  the  relations 


A  =  { (U. , V . ) 

1  J  ' 


U  .  £  V  .  <  U  . 

1  J  1  +  1 


or  u.  <  v .  s  u. 

i  J  i  +  1 


or  ui+l  <  vj  5  ui  or  u1+1  *  v  <  u.) 


and  the  dual 


®  -  { ( u  ^  ,  v 


v  <  u .  <c  v  or  v.  <  u.  S  v. 

J  i  1+1  21  2+1 


°r  Vj+1  <  Ui  5  Vj  °r  Vj+1  *  ui  <  vj>  • 


For  any  ,  y  , 


A  (G  ,  J  ( 


r  x .  i 

X 

Xi  +  1 

ly2. 

t 

[yj+lJ 

)  ) 


=  o  , 

if 

ui+l  ui 

or 

Vj+1  “ 

or  u . , u . 

s 

v .  ,  v 

i  i+1 

i'  2  +  1 

or  v.,vj  +  1 

s 

ui'ui+i 

14  o  , 

if 

(Ui,Vj)  6 

u 

* 

t 

For  any  1  S  r  <  »f ^ t  ,  1  s  s  S  mg ^ t  .  define 

*r  =  {u.  |  i ( r )  +  1  s  i  s  i ( r+1 )  -  1}  , 

gs  "  (vj  1  j(s)  +  1  s  2'  s  j(s+l)  -  1}  > 

Kr,s  =  ^ui  I  ui  e  ?r  &  3  e  5S  with  (u^v^)  e  a } 

For  any  «  gg  ,  define 


»r>s(v.) 

Also , 

=  (U.  , 

ui  €  ?r  with 

e 

■*}  • 

vr(2.cr,cr)  = 

m-1 

Z  |  A 

s=0 

■  (;*<*)■ 
ys+l  - 

I)h 

+ 

m-1 

Z  |  A 

s=0 

(S,J[[y1(r>]  '  | 
s 

yi(r)+ll] 
•ys+l  3 3 

!  1 

5 

2n„  i- 

g.  t 

b  -  a 

m-1 

./,0 

"Ui(r)-l 1 

'|vj+l  ~  vjl> 

Symbolization  and  Evaluation  of  Language 


201 


m-1 


+  jlo  min* |ui(r}+l_ui(r) ^ ' I vJ+l“vj 


I) 


b  -  a 
Simi larly , 


4n  n-1 

s  —  '  3l0  'VrV 


4nf  m-l 

VG'W  s  lu1+1-ui  I 

Now,  fix  any  1  S  r  s  m.  .  ,  l  s  s  s  m 

g  <  t 


,  and  consider 


V  ( G ,  E  ,e  )  : 
r  r ,  s  r,s 


Given  Oj  refined  to  ,  a‘2  refined  to  Qj  ,  fix 

refine  a*  further  to  O'  •  over  [x  x  l  c„rh 

1  1  1  i(r)+l,xi(r+l)-lJ  such 

again  w.l.o.g.  using  the  same  notation. 


max  |ui+1  -  u. 


( i ( r ) +1  sis  i ( r+1 ) -1 ) 
This  implies 


*r  =  u  }r,s(vj) 

v  .  e  <;  J 

J  s 

Let 


:  min  |vj  +  J  -  Vj 

rj(s)+isjsj(S+i)-i] 

lover  distinct  v^j 


r , s  ~  {lxi'Yj)  I  i(r)+l  Sis  i(r+l)  -1,  j (sj+isjs j (s+1 ) -1 


(ui'vj)  €  • 


Hence,  using  Lemma  2, 

fX«l 


Z  I  ^  (  G  ,  J  ( 

(x  ,  y  )  e  x 
a  1  p  ‘  r ,  s 


rk. 

a  +  1 
Vp+1 


)  ) 


-  2  2 

lv3'1,vi 

,  2  2  2 


|^(G, J( 

X 

a 

v 

> 

\*r 

) ) 

LV/3  J 

yp+ 1  j 

Ui€,r,s(vj> 


r  .-1  (|ui +i-ui I /(b-a) ) 

fxa€f  "(u^  1  +  1  1 

y/}€g-1(Vj) 


and 

that 
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5 


v.eg  u.<=>  (v.) 

3  s  i  rs'  2 


(4nf,fng,f lui+i-i1l/(b-a)) 


i(r+l)-i 

4n,  •  n  .t- 

<  f : t  gvt  <  2 


b  -  a 


lu 


i  =  i ( r ) +1 


1  +  1 


-  V 


Next,  define 


1 r ,s  *(xi,y3*  I  i<r)+1  ^  1  <  i ( r+1 ) -l ,  j(S)  +  1  s  j  s  j(s+l)-l, 

(ui,v;.)  e  a  +  ji} 


and 


, s  {vj  1  vj  €  5s  &  3  ui  6  5r  with  e  »  h  ji)  . 

Now,  defining  w.l.o.g. 


ar,s^V  g 


( 1 ( r )  +  l< ii i ( r+1 ) -l ) 


<  ^r,s*Ql*  "  max(ui) 


(i(r)+l<i<i(r+i)-i) 


it  follows  that  if  v.  €  a  then 

J  r » s 


v  .  ^  a  <  v  .  , 
j  rs  j+l 

v  .  <  p  s  v  .  , 
3  r , s  j+l 


7j+l 


r,s 


°r  Vj+1  <  Vs  5  vj  ‘ 


Consider,  e.g. ,  Vj  S  ar<s  <  vj  +  1  .  By  continuity  of 

3  ■  5  y°  ‘  9"1(ar.s>  wlth  v,  s  y„  <  yj+a  »ith  yq  «  iSitia 

for  some  k  .  1  i  k  i  n  s  n  .  Thus  by  Le„„a  5.’  we'.ust 

.  r ,  s  * ' 

have 


Yi  5  Vt.a 


r,s,k 


Vi  • 


Hence,  for  a  given  ,  there  can  be  at  most 


2‘ng,t  v3  e  Vs  ■  Thus' 


r*„i 


(*a'V  «  r,s 


|G' J{  [Y/3  J 


7a  +  1 
*0  +  1 


)  ) 
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V 

j  r,s 


4n,  *n  . 
f  ,  t  g,  t 


2  i ( r+1 )  -1 

8nf , t' ng, t  5  | 

b  “  a  i=i ( r ) +1 


i ( r+1 } -1 


i=i ( r ) +1 


ui+l  -  ui 


( lui+1-ui I /<b  "  a) ) 


Hence , 


[x  * 

Y^J  •  [Vi 

(x  ,y  )  e  r  of 

VA  1  /  O  i  |  O 

4nf  t.n  • (l+?r  )  i(r+l)-l 

5  -  A  — a - gj-L-  •  X  lui  +  i“uil  • 

b  a  i=i ( r ) +1  11  1 

Thus,  combining  the  above  results. 


*  <4"f  ji0|v3  +  l_vjl  *  4”g.fnf,t  1f0l"d*l-ui 

m-1 

+  4nf,t‘ng,t{1  +  2ng,t>-  ^0'ui  +  l-ui') 


S  r- -  •  1  .  . 

b  -  a  t 


where 


■».  =  4n,  _(in  .  +  n„  •(!  +  2n„  «. )  )  •  .d,  .  +  4m,  -n 


t  f,t'  g,t  g,t 


1/t  r,t  g ,  t  2 ,  t 


where 


by  BV,  B  d  ,d_  ,  >  0  constants  wil 

1  »  'C  t 


^0iui+rui'  <  di,t  and  <  d2 , t 


Remarks 


1.  If  f  =  const.  and  g  is  arbitrary,  then  Vr(8,K  )  =  0  for 
all  t  >  0  .  In  particular  g  may  not  be  of  BV  although  clearly 
here  G  is  BV  for  Theorem  11. 

2.  However,  if  f  even  has  one  discontinuity  at,  say,  x 


wi  th 
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jump  f(xQ+)  >  f(xQ)  .  say,  and  3  is  such  that  there  exist 

constants  a  <  b  such  that  for  all  a  i  x  S  b  , 

f(xQ+)  2:  g ( y )  *  f(xQ) 

and  g  is  not  of  BV  ,  over  [a,b]  ,  then  3  is  not  of  BV  . 


3.  Theorem  11  may  be  extended  from  two  one-dimensional  arguments 
for  G  to  any  number,  where 


G(x1#...,x  )  =  max(0,min 


f,  fl<xi>-a  fn(xn>-al 
l  '  b  -  a  '  •  •  ‘  '  b-a. 


for  all  x,,...,x  e  R  .  Thus  here 
l  n 

Vr ( 3 , [-t , t ] n)  <  1 1  /(b-a)  ,  for  y  <  +«  . 

n  /  x  n  1  x 

4.  Theorem  11  and  its  extension  mentioned  above  may  be  modified  to 

yield  3  being  of  strong  BV  ,  when  in  the  assumptions  of  the 
theorem,  all  dependency  on  t  is  omitted. 


Theorem  12  (Sufficiency  for  BV  of  cartesian  product). 


Let  <t». 


be  any  n-copula  with  probability  mixture 


representation  for  c.d.f.’s  Hj  '  Gj  ' 


VU1 . V  -  z  <bj"aj),Hj(ul . un)+  z  (aj-bj.jJ-GjtUj . un) 

j  1  31 

for  all  ulf . . . ,u  «  [0,1]  ,  where  constants 

0  =  bQ  £  a,  5  bx  <  a2  5  b2  s  • • •  s  1  , 


z  (b  -a  )  +  z  (a ,-b .  )  =  1  , 

j=l  3  3  j=l  3  3~1 


Hj(ul . V  =  % 


(j)lUl  ~  a3 


5j  •  ai 


“n _ 

FT  -  a  . 
j  3 


where  $ 


&  J '  is  an  Archimedean  n-copula  with  p.d.f.  g^.  bounded 


over  [0,1]  by  <  +®  , 


,  ui  -  b  .  , 

G,(u,,...,u  )  =  max( 0 , min  1,  - - r-- 

j  l  n  aj  "  bj-l 


Un  -  bj-l 
'  'a .  -  b  . 

J  J-l 
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for  all  u1,...,uri  e  [0,1]  ;  j  =  1,2,... 

Suppose  satisfy  Theorem  11.  Define 

Cjt  =  Cj-Vr(f1,[-t,t])*--Vr(fn,[-t,t]) 


and  f ,  x  •  •  •  x  f  :  Rn  -►  [0,1]  by , 


(flx‘--xfn){xl . V  -  VW . fn(xn>> 


for  all  x,  ,  .  .  .  ,x  €  ir  .  Then  Theorems  10  and  11  yield: 
In 


(i)  Vr( f 1x* •  ■  *fn<  [-t,t]“) 


s  z 


ci,t 


+  Z 


j  =  l  (b  -  awn  1  j  =  l  j,t 
J  j 


(ii)  Thus,  if  also,  e.g., 

0  =  bn  <  a.  <  b.  <  a_  <  b0  <• • • <  a^  <  b  =  1  ,  then  f.x---xf  is 

01122  nn  In 

BV  . 


(iii)  With  obvious  modifications,  SBV  holds  for  f.x- • -xf 

l  n 


We  will  say  that  a  copula  <$>&  has  a  bounded-finite  probability 

mixture  representation  iff  the  conditions  of  Theorem  12  iii)  hold 
for  <*>&  . 

Theorea  13  (Basic  approximation  II) 

Let  be  a  copula  with  a  bounded-finite  probability  mixture 

representation . 

Let  f ,  g  :  ir  -*  ir  be  arbitrary,  bounded  and  Borel  measurable. 
Then  there  exist  sequences  (f_)„  ,  „  ,  (g_>  ~  of  BV 

functions 

f  ,g  :  R  -*  ir  for  which  for  all  t  >  0 
m  3m 

0  =  lim  J*  |  f ( x )  -  f m ( x )  |  dx  =  lim  j*  |  g ( y )  -  9m(y)  I  dy 
m-»®  x=-t  m-*®  y=-t 

and  which  are  BV  and  satisfy  the  conditions  of  Theorem  12  so  that 

f  x  is  BV  for  m  =  1,2,..,  ,  and  for  all  t  >  0  ,  (f  x  <,  )  , 

m  m  m  Jm 

1  2 
m  =  1,2,...,  converges  in  L  -norm  to  f  x  g  over  [-t,t] 
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m 


0  =  lim  /  J  | 
n-*»  x ,  y 

In  particular  f 

ra 

1,2, .. .  . 


(fxg)(x,y)  -  (fxg_)(x,y)  |  dxdy  . 

e  [-t,t] 

and  g  may  be  chosen  as  step  functions; 
m 


Proof : 


First 

choose 

f 

m 

and 

g  as 

sm 

step 

functions  as  in 

Theorem  3 

(i)  ,  m  = 

1,2 _ 

Clearly 

each 

f 

m 

and  g  satisfy 

m 

the 

hypo- 

theses  of 

Theorem 

11 , 

and  by 

Theorem  12 

(ii)  ,  f  x  g 

m  m 

is 

BV  ; 

m  =  1,2,...  , 

It  also  follows  readily  for  all  x,y  e  [-t,t]  ,  by  use  of  the 

triangle  inequality  and  Moynihan's  Theorem  (see  section  2.3.6). 

|(fxg)(x,y)  -  (fmxgm) (x,y) | 

*  l<t>&(f  (x)  ,g(y))  -  <|>&(fm(x)  ,f  (y) )  | 

+  I4>&(fm(x)  ,g(y)  )  -  i}>s(fin(x)  ,gm(y) )  | 

5  lf(x)  "  +  l9(Y)  "  9m(Y) I  . 

whence 


/  |(fxg)(x,y)  -  (f  xg  )(x,v)|  dxdy 

x, y  €  [-t,t]  m  m 


<  2t 


(  r  i 

x=-t 


f(x)  - 


f  _  ( x ) 
m 


dx  +  J  |  g  ( y ) 
y=-t 


gm(Y> 


dy 


Theorem  14  ( Hel ly-Bray-1 ike  theorem) 

Let  g  :  r  be  continuous  over  closed  q-interval  K  . 


(i)  Let  fn  :  ir  be  uniformly  bounded  over  K  and  uniformly 


and 


BV 

over 

K  :  Vr(f  K)  S 

7 

O 

<  +®  ,  for  all 

dent 

of 

g  ,  3  subsequence 

(fnj) j=l,2, . . .  c 

there  exists  f  :  -♦  JR 

o 

BV 

such  that 

(a) 

lim  f  (x)  =  f  (x) 
nj  ° 

for  all  x  e  K 

(b) 

Vr(fo,K)  s  to 

(c) 

lim  J  g(x)df 

X«K  j 

(x) 

=  /  g(x)df  (x) 

X  €  K 
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(ii)  If  (i)(a)  holds  for  f 
also  hold. 


3  =  1.2, 


then  ( i ) (b) , ( c) 


Proof :  See,  e.g.,  [181'],  pp.  233-234. 

Theorem  IS  (Basic  convergence  theorem) 

Let  K  be  a  closed  q-interval.  Let  f  :  ir^  R  be  BV  .  Let 

g  :  iRq  ir  and  g  :R^-»iR,n  =  l,2,...  all  be  continuous  with 

3n  3o 

lim  gn(x)  3  90(x)  >  uniformly  in  x  e  K  . 
n-»o> 

Then 

lim  /  g  (x)df(x)  =  /  g  (x)df(x)  . 

n~»<®  xe  K  xe  K 


Proof :  Standard;  see,  e.g.,  [181 '],  p.  232. 


Remarks : 

1.  Theorems  14  and  15  may  be  extended  to  where  gn  ,  n  =  1,2,... 

and  g  are  finitely  piecewise  continuous  but  such  that  over  all 
regions  of  their  discontinuities,  all  fn  are  simultaneously 

continuous . 

2 .  For  the  space 

7(l)  =1  { f  |  f  ;  ir9  ir  is  measurable,  bounded  and  of  BV)  ,  for  each 

t  >  0  ,  the  L1-norm  over  [-t,t]^  may  be  used,  but  clearly  Theorem 
3  (i)  shows  that  it  is  not  complete  since  any  bounded  measurable 

non-BV  function  may  be  L1([-t,t]<^)  -  approximated  by  a  sequence  of 

f  «  f (q)  ,  n  =  1,2 . 

n 

The  supremum  norm  furnishes  an  alternative  norm  for  (over 

all  Rq)  ,  but  approximations  analogous  to  Theorem  3  must  be  modi¬ 
fied  (Lusin's  and  Egorov's  Theorems)  up  to  sets  of  arbitrary  small 

positive  measure.  Theorems  14  and  15  imply  that  in  general  conver¬ 
gence  under  the  sup  norm  can  only  occur  for  uniformly  bounded  and 
uniformly  BV  functions,  in  which  case  a  form  of  completeness  then 
holds . 

3.  For  the  subspace  (of  5^^}  of  all  c.d.f.'s  over  iRg  ,  the 

L6vy  metric  over  furnishes  a  genuine  complete  metric  (and  norm) 

where  convergence  in  the  L&vy  metric  is  equivalent  to  weak  conver¬ 
gence  (i.e.,  pointwlse  over  all  points  of  continuity  of  the  limit 
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function).  (See,  e.g.,  Schweizer  and  Sklar  [26],  Chapter  4.) 

Consider  now  the  following  discrete  case  which  will  be  useful 
in  the  next  theorem: 

Lee  Uj  £  s  be  discrete,  j  =  l,...,q  ,  such  that  for 

•  =  (x.  |  l  =  .  .  .-2, -1,0, 1,2,  .  .  .}  with 

J  J 

<xj,-2  <  xj.-i  <  xj.o  < 


-  xj,<>  '  0 


all  j,« 


Let  f  :  x  D  .  R  be  arbitrary.  Let 


d  q 

Q=(J(x)|x€  x  D.) 

j  =  l  3 


be  a  semi-closed-open  partitioning  of  Rq  such  that  for  all 


€  x  D  , 

3  =  1  ^ 


X1  l 

1 '  1 

q  n  +  d 

II 

X  0 

€  X  D.  ,  X  = 

3=1  3 

X 

q  ,1 
^  a 

a  ,  l  J- 1 
^  a 

J(x)  =  '  Xj,<+1>'  J{x)  =  '  xj,t+l]  =  J(x'x+>  • 

ir' 

In  turn,  define  fQ  ,  the  step  function  imbedding  of  f  into  IR 

q 

where  for  any  y  «  Rq  ,  let  J(x)  «  Q  be  unique  such  that 
y  «  J(x)  . 


fQ(y)  -  f(x)  . 

It  follows  that  for  any  g  :  Rq  -•  IR  such  that  f  is  contin- 
d  q 

uous  at  all  x  <s  D  =  x  D.  , 

3  =  1  J 


,  J  dP{y)dfQ (y>  =  1  g(^)'^  (ffl,j(x)) 

(yeiR4)  u  xeD  q 


=  Z  g ( x )  A  (f , J(x) )  , 

xeD  q 
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fQ  being  BV  .  Also,  is  SBV  with 

Vr(f  )  =  X  \A  (f,J(x))|  iff  that  series  converges, 
xe  D  q 


Theorem  15 '  (Fundamental  approximation  of  Lebesgue-Stielt jes  measure 
and  integration) 

(i)  Let  g  :  r  be  continuous. 

Let  f  :  Rq  -♦  R  be  bounded,  measurable  and  BV  .  Thus, 
w.l.o.g.  f  =  f1  -  f2  with  f  ,  f2  both  g.d.f.'s  and  hence  both 

are  continuous  over  Rq  except  on  at  most  a  countable  subset,  say, 

G  of  Rq  . 
o 

For  each  integer  nil,  define 


Dn  ^  <-n  +  k/2n  |  k  =  0,1,2 . n-2n+1)  c  [-n,n] 

with  mesh  l/2n  .  Thus,  for  any  nil,  D  refines  D 

n+1  n 

Also,  define  B  =  (D  ) q  =  D  x---xD  £  [-n,n]q  and  note  that  B 

n  n  n  n  n+1 

refines  Bn  ,  for  all  nil.  For  each  nil,  let  Qn  be  the 

associated  semi-closed-open  partitioning  of  [-n,n]q  .  Hence  an+1 

refines  Q  over  both  [-(n+1) ,n+l]^  and  hence  [-n,n]g  . 
n 

Then  define  for  each  nil. 


f  a 

n 


<f‘Va 


over  [-n,n] 


over  [-n,n] 


It  then  follows  that  for  any  t  >  0  ,  (see  also  Theorem  2) 


(a) 


( f  n )  n_i  2  -•  f  pointwise  over  [-t,t] 


except  possibly  over  Gq  , 


(b) 


(fn)n=l,2,...  f  over  • 


(c)  Vr(fn, [ -t , t ] ^ ) 


r  Mq((f  I  Bn),J(x)) 


x  e  B 


n 


<  Vr(f,[-t,t]M) 

<  +■»  , 
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for  all  nit. 

Hence  by  Theorem  14,  for  any  t  >  0  , 

(d)  /  g(x)df (x)  =  11m  J  g(x)dfn(x)  . 

xe[-t,t]q  n-»®  xe[-t,t]q 

=  11m  X  g(x) -A  (f , J(x) )  . 
n-*°°  x€  D  q 


(ii)  Result  (i)  can  be  extended  w.l.o.g.  by  slight  changes  in  the 

constrution  of  the  f  's  so  that  g  may  be  considered  continuous 

n 

except  on  a  finite  number  of  points  or  piecewise  continuous,  etc. 


Theorem  16  (BV  measure  reduced  to  copula  measure) 


Let  f  : 

Rg  R 

be  a  c  .  d .  f  .  with 

one  dimensional 

marginal 

f . 's  f  , 

6)  yields 

R  -►  R  which  by 

Sklar1 s  Theorem 

( section 

f  (xx 

. V 

=  Vfl(Xl> . fq(Xq)}  ' 

all  x.^ ,  .  . 

.  ,x  e  R 

q 

and  some  n-copula 

• 

Let  g  : 

Rq  R 

be  measurable. 

Recall  the  quasi-inverse  f^  (see  also  Appendix,  2.3.6) 


defined  by 


f*(x)  3  inf  fj^x.l)  ;  x  €  [0,1]  , 


for  j  =  1 , 2  ,  .  . .  .  Then 


I  g(Xj,  .  ,  .  ,xn)df  (xa . xn) 

X. . X„  €  R 

l  n 


J  g(x1(  ...  ,xn)d4>&(f1(x1)  ....  ,fn(xn)  ) 


xx . xn  €  * 


‘  f!  (U1  > . fn(V)dVul . V 

U i , . . . , Un  €  [0,1] 


Proof :  See  Schweizer  and  Sklar  [266],  section  6.5. 


Marginal  functions 

Recalling  the  previous  definition  of  C-marginal 
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fc  :  Rcard(c)_  R  for  any  given  f  :  Rq  -»  R  and  Z  *  C  s  {l,...,q}  , 

suppose  first  that  f  is  PM  .  This  condition  is  not  sufficient 
for  all  marginals  to  be  also  PM  .  For  example, 
h ( x , y )  =  f(x)  +  g(y)  is  PM  since  42(h,k)  s  0  ,  for  all 

2-intervals  K  ,  for  any  choice  of  f,g  :  R  -»  R  . 

However,  the  following  holds 

Theorem  17 

Let  f  :  R*^  -»  R  be  given. 

(i)  If  f  is  a  PM  g.d.f.  which  is  bounded  -  equivalently  f  is 
also  SBV  and  thus  f(-H»lg)  <  +*  is  the  supremum  of  f  ,  then  for 

all  Z  *  C  s  {l, - q)  ,  c-marginal  fc  :  Rcard(c>  _  R+  exists  and 

is  a  g.d.f. 

(ii)  If  f  is  g.d.f.  with  minimal  decomposition 

f  =  (f)+  -  (f)“ 

such  that  (f)+  ,  (f)  and  hence  f  are  all  SBV  and  hence  all 
bounded,  then  for  all  z  *  C  £  {l,...,q}  ,  C-marginal  fc  exists, 

although  in  general 

fc  11  (fl£  -  <flc  ' 

and  fc  is  a  g.d.f.  which  is  also  SBV  .  The  result  may  be  modi¬ 
fied,  if  f  being  SBV  is  replaced  by  BV  and  bounded. 


Proof:  Use  hereditary  properties  of  groundedness,  right  continuity 

and ,  by  Lemma  1 ,  PM  . 


Remark: 


Theorems  7  and  17  (ii)  imply  that  if  f  :  Rq  R  is  SBV  ,  for 
all  Z  *  C  £  (l,...,q)  ,  ?  is  a  generalized  distribution  function 

which  is  SBV,  for  f  equivalent  to  f  . 

It  is  of  some  interest  to  relate  measures  ^  induced  by  the 

marginals  of  f  and  marginal  measures  (to  be  defined)  of  fj 

? 
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2.4  Category  and  topos  theory  viewpoint  of  generalized  set  theory. 
2.4.1  Introduction . 


A  logical  basis  for  fuzzy  set  theory  may  be  established  using 
[0,1]  -  valued  logic  and  its  semantic  evaluations  as  presented  in 

previous  sections  2.3.2  and  2.3.3. 

An  alternative  approach,  entails  the  imbedding  of  Zadeh ' s  min- 
max  fuzzy  set  theory  (5  )  Into  a  special  category  theory  structure 

called  a  topos,  so  that  formal  language  and  set  theory  investiga¬ 
tions  may  be  carried  out.  (See  Johnstone  [127],  MacLane  [166],  and 
Lawvere  [152].)  In  the  ullowing  discussion  [0,1]  is  replace  d  by 
the  more  general  structure  of  a  complete  Heyting  algebra  H  .  (See, 
e.g,,  Ponasse  [204]).  Nothing  is  gained  by  restricting  H  =  [0,1] 
and  analysis  is  eased  by  use  of  che  more  general  structure.  Goguen 
[96]  using  six  axioms  abstracting  properties  from  Fuz(H)  -  Zadeh 1 s 
fuzzy  set  theory  with  [0,1]  replaced  by  H  ,  the  objects  of  cate¬ 
gory  Fuz(H)  being  fuzzy  set  membership  functions  and  the  arrows 
fuzzified  functional  transforms  between  objects  -  showed  that  any 
category  satisfying  them  necessarily  must  be  the  category  Gog(H)  , 
where  the  object  collection  0b(Gog(H))  consists  of  all  possible 
pairs  (X,$A)  ,  where  A  is  any  fuzzy  subset  of  X  and  the 

morphism  or  arrow  collection  Ar(Gog(H))  consists  of 


Ar ( Gog ( H ) )  =  U  Ar . 

over  all 
(X.<t>A).(Y.4>B) 
.€  0b  ( Gog  { H  )  / 


X.1>A)  -  (*- V  ) 


(Gog 


(H)  )  J 


Ar 


(  (x.<t>A) .  (  Y ,  <J>B  )  ) 

^  (f  |  f  :  X 


(Gog) (H) ) 
-  Y  and 


fB(  f  (x)  )  2.  $A(x)  . 


all  x  e  X)  . 


The  motivation  for  the  last  definition  cones  from  the  extension 
principle  applied  to  Zadeh ' s  fuzzy  set  system  (1  -  (•),  min,  max)  , 
i . e . ,  for  any 

f  :  X  -  Y 


and  fuzzy  subset  A  of  X  ,  for  all  x  «  X  ,  it  follows  tha^  for 
all  x  €  X 


*f(A)(f<X))  2  VX)  • 

See  subsection  2.4.2  (D)  for  further  details  for  Gog(H)  . 

Chapin  [32],  [33],  Weidrer  [270]  and  Novak  [195]  considered  the 

membership  operator  e  and  fuzzy  set-like  extensions  connected  with 
it.  All  of  these  and  Albert  [2],  as  well,  have  approached  fuzzy  set 
theory  through  an  axiom  system  with  deductive  rules.  For  example, 
Chapin  uses  14  axioms,  but  one  difficulty  with  his  system  -  unlike 
Goguen ' s  [96]  ,  mentioned  above  -  is  the  possible  lack  of  uniqueness 
of  the  resulting  model  of  fuzzy  set  theory.  (See  again,  23. 4.) 

Ey tan  [56]  produced  a  variation  Fuz(H)  of  Gog(H)  .  (Subsection 
2.4.2  discusses  Eytan's  formulated  of  Fuz(H).)  Indeed,  Garrega 

[29]  demonstrated  easily  that  Fuz(H)  and  Gog*(H)  -  Gog  t  H )  , laving 

all  of  its  objects  (fuzzy  sets)  restricted  to  their  supports  -  when 
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H  is  a  complete  chain  (the  case  of  course  for  [0,1] 
isomorphic . 

Fuz(H)  is  a  category  where  for  any  (X,< p^)  «  0b(Fuz(H) 

identity  arrow  (under  °  )  in  Ar,,v  .  .  (v  A  . .(Fuz(H))  is  id 

^  ^  A  '  A ' ™A 


)  ,  are 
i  )  ,  the 

(X'V  ’ 


where 


ldix,v  s  V> 


(•  •  )  a  a, 


X  X  X  -  [0,1] 


(For  Gog(H)  ,  composition  of  arrows  is  the  same  as  ordinary  com¬ 
position  of  functions  and  the  identity  ar.ows  are  the  same  as  the 
ordinary  identity  functions  over  sets.) 

Eytan  [56]  also  claimed  that  Fuz(H)  was  a  topos .  However, 
Car:  ega  [29],  Ponasse  [  204]  and  Pitts  [203]  all  pointed  out  the 
incorrectness  of  Eytan's  results.  (H  necessarily  must  be  a  Boolean 
algelra  which  is  not  the  case  for  [0,1].)  Indeed,  Carrega  specifi¬ 
cally  showed  that  Fuz(H)  was  a  peudotopos,  i.e.,  possessed  all  of 
the  required  basic  properties  to  be  a  topos,  except  -hat  it  has  no 
subobje't  classifier  ([127],  pp .  23,24).  Also,  Pitts  [203]  claims 
that  Higa(H)  is  a  better  candidate  than  Fuz(H)  for  developing  a 
"fuzzy"  set  theory,  since  Higg(H)  is  not  only  a  category,  but  a 
tcpos  and  the  quotient  object  completion  of  Fuz(H)  ,  Higg(H)  (which 
as  shown  below  contains  FuziH)  as  a  diagonal  imbedding)  treats  si¬ 
multaneous  memoershlp  -  as  opposed  to  Fuz(H)  treating  single 
memberships,  although  Fuz(H)  is  extendable  to  non-interactive 
multiple  membership  functions  by  repeating  the  (here, min) 

operation : 


W 


■V  -  VW . W> 


all 


x 


1"  ‘ 


,X  € 

n 


More  tie-ins  with  multiple  point  coverage  functions  of  random  sets 
will  be  treated  later.  (See  subsection  2.4.2  for  further  details 
concerning  Higg(H).) 

Since  min  is  the  maximal  t-norm,  a  more  general  class  of 
Higg's  objects  is  obtained  by  replacing  min  by  in  the  above 

definitions.  In  a  sense,  ea  h  Higg  object  f  is  a  kind  of  equality 
over  X  x  X  . 

If  (X,<J>A)  e  0b(Fuz(H))  is  arbitrary,  then  clearly 


(X,  id(x<(?)  })  €  0b(Higg  (H)  )  . 

Conversely,  if  (X,f)  €  Ob(Higg(H))  , 

( X , D ( f ) )  «  0b ( Fuz ( H ) )  , 

where  D(f)  is  the  diagonal  part  of  f 

D(f ) (x)  =  f (x,x)  , 

•for  all  x  «  X  . 

See  Fourmann  and  Scott  [59]  for  bas.c  properties  of  Higg(H)  , 
which  is  shown  to  be  the  Grothendieck  topos  of  all  sheaves  over  H  . 
Furthermore  (see  [59]  and  [56])  Higg(H)  can  be  used  to  realize 
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general  formal  higher  order  many-sorted  languages -through  a  form  of 
intuitionistic  logic  yielding  sound  and  complete  semantics.  (See 
also  subsection  2.3.3.) 

In  a  second  (unpublished)  paper,  Eytan  [57],  whose  earlier  work 
was  to  a  large  degree  based  on  Coste's  construction  [41]  being  a 
topos,  extended  Fuz(H)  by  weakening  the  single-valued  condition 
for  the  arrows  of  Fuz(H)  . 


2.4.2  Basic  concepts 

A.  Basic  fuzzy  set  category  Fuz(H)  ( Evtan  [56].) 


SET  =  (Ob(SET),  Ar(SET))  is  the  category  of  all  well-formed 
ordinary  sets  with  Ob(SET)  ,  the  object  part,  being  the  collection 
of  all  well-formed  sets  and  Ar(SET)  the  collection  of  all  arrows 
for  SET  -  in  this  case 

Ar ( SET )  =  U  Ar^  y  (SET)  , 

X,Y  e  Ob ( SET ) 

Arxy(SET)  =  YX  =  (f  |  f  :  X  -  V)  , 
for  all  X,  Y  e  Ob(SET)  . 

We  use  similar  notation  for  any  category  c  =  (Ob(c ) , Ar (c ) )  , 

Ar(c)  =  u  Arx ,y(C)  • 

X , Y  «  Ob(C ) 

For  any  fixed  complete  Heyting  algebra  H  and  X  e  ob(SET)  , 
define 

Obx(Fuz(H)  £  {  (X,<7 )  |  <7  :  X  -  H) 

=  UX.17)  |  n  «  HX) 


the  class  of  all  H-valued  membership  functions  on  X  ,  and  in  turn 


Ob(Fuz(H) )  =  U  Obx(Fuz(H)) 

X  e  Ob ( SET ) 

is  the  class  of  all  H-valued  membership  functions. 

Note  that  for  any  X  e  Ob(SET): 

(1)  Obx(Fuz(H))  2  <}>x  3  {(X,<7)  |  n  «  (0,1)X)  ,  the  class  of  all 

ordinary  set  membership  functions  on  X  ,  i.e.,  membership  functions 
of  X  . 


(2)  The  operations  a  ,  v,  C  over  H  may  all  be  used  to  define 

X 

operations  on  Ob  (Fuz(H))  ,  where,  for  any  rj  q  ....  e  H  , 

A  12 
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(X,71)  A  (X,/72)  =  (X.t'Jj  a/72))  , 

(X,/71)  v  (X,/72)  *  (X,{/7J  v  /;2)>  , 

(X,/71)  =»  (X.i72)  =  (x,^  a  n2)) 

C  (X,<7j)  =  {X,  {  C/72  )  )  . 
where  for  all  x  e  X  , 

(*?!  A  /7  2 )  ( x)  =  ^j(x)  A  /72(x) 

(n  1  v  *7 2 )  (x)  ^  /?1(x)  V  /72(x) 

('71  =»  72)(x)  -  ^(x)  »  72(x) 

(C^)  (x)  =  C(/71(x)  )  ,  etc. 

Furthermore,  because  of  commutativity  and  associativity  of  a 
and  v  ,  we  can  define  similarly  unambiguously  recursively 

(X,/71)a(X,/72)  a  •••  3  (X,/7iA^2A- •  •  ) 

(X,/7  j  )v(X,/72 )  v  •••  =  (X,/71v/72v- •  )  ,  etc. 


(3)  Also,  we  can  define  the  cartesian  products  and  cartesian  sums 

Xj 

for  any  X^  e  Ob(SET)  ,  ^  «  H  J  ,  J  *  1,2 . 


(X1''7,xa(X2^2}xa' 


(XaxX2x- 


q  ,x  n  • 

'1  A  '  2  A 


) 


(xr7i>  \^2>n2)  \- 


=  (XjXXjX' 


,/7  ,  t  n  _t  • 
’  '  1  v  '  2  v 


where,  for  any  xj  €  Xj  *  J  =  1,2,... 

('71Xa'72Xa'  ‘  '  )<xi'x2'  ‘  '  ’  >  “  (x:  )  a^2  (x2  )  a- 


''V  v'V  v' ’  '  }  (X1  ,X2' "  '  5  =  /71{x)v/?2!x2)V' 


(4)  For  any  X  e  Ob  (SET)  and  nil  and  <7  e  H  =  HX  , 

V 

define  D(y)  «  H  by,  for  all  x  e  X  ,  the  diagonal  part  of  q  at 
x  , 

D(/7 )  (x)  =  17  (x,x,  •  •  •  ,x)  , 

and  in  turn,  define  for  any  ( Xn ,  17 )  e  Ob  (Fuz(H))  , 

Xn 

D  ( Xn ,  #7  )  =  (X,D(i7  )  )  «  Obx-(Fuz(H))  . 
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Thus,  for  any  n  >  1  and  q ^,q 2> . • . ,q n  e  H  > 
D(  (X,^  )xa-  •  •  xA(X,/7n)  ) 


=  (X,/71a-  •  •  A*7n)  ; 

D(  (X,i71)tv.  •  •  tv(X,/7n)  ) 

=  (X^v-  •  •  v/7n)  , 

(5)  It  follows  readily  that 

(i)  For  any  X  e  Ob(SET)  and  any  Aj,...,An  «  »(X)  ,  with 


❖x( Aj)  54  (X.^  >  . 


where  <}>B  denotes  the  membership  function  for  B  «  ?(X)  .  i-e. 


1  iff  x 

€ 

B 

V*)  - 

0  iff  x 

B  , 

for 

all  x  «  X 

,  and  any 

B 

0 

3  =  i.. 

.  .  ,n  , 

then 

(X.<t>  )  A  *  • 

•a(X,<J>a  ) 

= 

(X, 

■*A  A--- 

a*A  } 

1 

n 

1 

n 

= 

(X, 

nA  ^  ’ 

l 

n 

(X,<t>,  )v-  • 

•WX.*  ) 

= 

(X, 

'^A.U- • • 

UA  *  ' 

1 

n 

1 

n 

c(x.«>A  )  = 

Ai 

<x,c<t>A  ) 
A1 

— 

(X, 

i-n  ) 
*1 

=  (X,<J> 

CA1>  • 

(ii) 

Similarly 

for  any 

XJ 

€ 

Ob (SET) 

and 

Aj  S  ! 

3  = 

1 # . . . »n  / 

<X!'V>xa 

•••w 

*A 

)  = 

»  (XjX-- 

•xXn,4> 

A.  x  •  •  • : 

1 

n 

i 

(Xl'\)fv 

W 

*A 

n 

)  = 

■  (XjX • • 

xXn'^ 

Aar... 

(6)  If  H  has  additional  structure,  other  operations  in  addition 
to  the  ones  above  may  be  defined  over  Fuz(H)  which  in  a  natural 
way  extend  ordinary  set  operations.  In  particular, 

if  H  =  [0,1]  ,  we  can  let  :  H  x  H  -►  H  be  any  t-norm,  with 
$or  :  H  x  H  H  any  t-conorm,  and  <p  :  H  -►  H  be  any  negation  - 
or  even  more  generally,  could  be  any  conjunction  operator  and 
$  any  disjunction  operator. 


(7)  One  approach  to  a  relation  on  0b(Fuz(H) )  which  extends  ordi¬ 
nary  subset  inclusion  is  to  define  for  any  X  €  Ob(SET)  any 


Symbolization  and  Evaluation  of  Language 


217 


71./72  e  H 


iff 


iff 


(  (X,<7  j)  SH  ( X ,/7 2  )  ) 


/7a  e  Sub(X,<72 ) 


/7 ^  s  /7 j  pointwise  over  X  ,  i.e.. 


01{x)  /72(x)  ;  for  all  x  «  H  . 

Thus,  for  any  A1<A2  6  *(X) 

(X.*A  )  sH(X'<i >A  >  lff  Ai  S  A2  * 

Define  Ar(Fuz(H))  as  follows. 

For  any  (X^^)  ,  (X2,/?2)  6  Ob(Fuz(H))  ,  let 

Ar,v  „  \  iv  „  \ (Fuz(H) ) 

X1  1  '  X2  '^2 

=  {g  |  g  :  XlXX2  -  J  & 


(FI) 

(F2) 

(F3) 


( g(  •  *  ■  • )  *  ^7 a  ( •  )  a  ^7 2  ( ' ' ) ) 


v  g  ( •  ,x2)  =  i?1(- )  & 

g(  •  ,  •  •  )  a  g(  •  ,  •  •  •  )  s  a . ) 


X2€X2 


where  we  use  the  Kronecker  delta  notation 


a  =  f  1  ,  if  x  = 
x,y  l  0  ,  if  x  * 

and  alternatively,  8  (X) 

x ,  y 

are  restricted  to  be  in  space 


y 

y 

is  used  to  indicate  that 
X  . 


x 


It  follows  from  (F2)  and  (F3)  for  any 
g  €  Ar/V  )(Fuz(H))  ,  there  exists  a  unique 

l  Aj/7  )  ,  l  *2  ,r‘  2 

hg  :  (X,  H  711(  0))  -»  X2 ,  such  that  for  all  x1  e  X1  H  /7 1 1  ( 0 ) 
/71(x1)  >  0  , 


V  g(x1 ,x2)  =  g(x1,hg(x1))  =  ^(Xj) 
x2  €  X2 


and  y 


i.e., 
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and  for  all  x 2  e  X2 

g(x1<x2)  =  «hs{Xi)>XaA  . 

But  (FI)  implies  that  at  x2  =  hg(Xl)  , 

g(x1,hg(x1))  =  <;®"  Jx  )  S^rj  |x  )  a 2n  £h  £x  )  ) 

whence 

n1(x1)  S  /72(hg(xl)  }  ' 

for  all  x^  s  H  /7 ~ 1  { 0 )  . 

This  thus  implies  trivially  that  for  all  xi  €  X  H  /7 ~ 1  ( 0 )  and 
all  x2  e  X 

g(xi-x2)  =  8h  (Xl),x2A  ’i^i1  A  W  • 

On  the  other  hand,  if  ^(Xj)  =  0  ,  then  (F2)  and  (F3) 

imply  that  for  all  x2  s  X2  , 

0  =  g(x1(x2)  . 

Hence,  if  we  (arbitrarily)  extend 
ha  :  xi  -  x2  • 

it  follows  that,  for  all  Xl  e  X1  ,  x2  €  X2  , 

g(Xj  ,x2 )  =  ahg(X;t)fx2  A  ,71(X1)  A  ,?2(x2)  • 

Note  also  that  it  follows  that  for  all  Xj  €  Xj  , 

/71(x1)  s  /72(hg(x1)  )  . 

Conversely,  if  h  :  X^^  -♦  X2  is  arbitrary,  define 

g(h)(,'-*)  =  6h(.),*-A'7l(')A/72(")  • 

Then  it  follows  (see  Eytan  [56],  p.  59,  Proposition  3.3.1) 

that 

g(h)  *  Ar(X1,/71),(X2,72)(Fu2(H)) 

iff 

'71(x1)  S  /72(h(x1)  )  , 

for  all  x^  «  Xx  . 

Hence,  in  summary: 

Ar(X1,/71),(X2,/72)(FuZ(H)) 

=  (8h(-),--A  °1{')  A  r’2{")  I  h  :  X2  -»  X2  &  rj1{‘)  *  /7  2  ( h  {  -  )  )  } 
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=  n1i-)  I  h  :  X1  -  X2  &  *i(->  5  n2(h(-))}  , 


Ar ( Fuz (H) )  =  U 
X1 

«  Ob(SET) 


X1'X2 


Ar  >  y  _  .  ,v  „  . (Fuz(H) )  . 

(  a  ^  ,  ^7  ^  )  *  ( X2  '  ^7  2  ' 


B.  A  connection  between  Ar(Fuz(H))  and  extension  naps. 

Let  <}>&  be  any  fixed  t-norm,  such  as  <|>&  =  a  and  <f>Qr 

t-conorm  such  as  „  =  v  .  Then  define  the  extension  or 

or 

operator 

#  :  Ar { SET )  -  Ar ( Ob ( Fuz ( H) ) )  , 
where  for  any  ,  X2  e  Ob  (SET)  ,  and  any  h  :  Xj  -*  X2 , 

#(h)  :  0bv  ( Fuz ( H ) )  -  0bv  (Fuz(H))  , 

X1  X2 

X-1 

by,  for  any  rj  ^  e  H 

(#(h))(X1,^1)  ^  (X2.#(h){i71)) 

X2 

where,  #(h )  (rj  ^  «  H  is  given  by,  for  any  x2  e  x2  : 

((♦(h))(i71)){x2)  ^  <Por  (^(x))  *  H  . 

Xj  «  h-1(x2) 

This  implies 

(i)  If  x2  €  rng(h)  ,  h  1  [x^  -  0  and 
((*(h))(i71))(x2)  =  0  , 

(ii)  For  all  x1  e  xi  , 

'71(x1)  £  (  (#(h)  )  {rj1 )  )  {h{x1 )  )  , 


(iii)  For  any  c  e  H  ,  x^  e  x^ 


(#{h,>{0x1,.A  c)  =  ah(x1),.-A  c 


(iv)  Thus,  using  (iii)  in  the  definition,  for  any  x2  e  X2 


((#(h))(/7,))(x_)  =  <f> 


or 
X1  6  X1 


(Sh(Xl),x2  A 


be  any 
lifting 
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<t>or  ((#(h))(ax  _  a  /71(x1) ) )  (x2)  , 

xi  6  xi  r 


(#(h))(i71)  =  *or  ((#(h))(«x  (<  a  i/jtxj))) 
*1  *  *1  '' 


( pointwise ) 


*  8h(-),-.  A  V'1  • 


Hence,  if 


(X2,/;2)  =  #(h)((X1.i71)) 


^(Xj)  £  rj2(h(x1))  ;  all  x1  e  X1 


n2{")  *  *h(.),--  A  *1(*>  €  Ar(Xll7l,(X2,72)(Fuz<H))  '  etc’ 


C.  General  compositions  of  binary  Mappings. 

Let  <}>&  and  $Qr  be  any  fixed  t-norm  and  t-conorm. 

Let  X2  ,  X2  ,  X3  e  Ob ( SET )  be  arbitrary  with  f  :  X^^  x  X2  -»  H 
and  g  •.  X2  x  x3  -♦  H  also  arbitrary.  Then  define  composition  or 
relational  product  mapping  gof  :  X^  x  Xg  -*  H  by,  for  all 

X1  €  X1  '  x3  €  X3  ' 

(g°f )  (Xj  ,Xg)  =  <for  (4)&(f  (x1(x2)  ,  g(x2,Xg))  . 

X2€X2 


In  particular,  note  that  if  h  :  X2  -*  Xg  . 


f  ( *  ,  *  •  )  *  n  ( • • )  and  g {••,••• )  =  a 


h(  •  •  )  , 


(g°f)  (•  .•  •  • )  =  #<h)  (n)  (•  •  • )  , 


It  follows  that,  if  $jc.  is  distributive  over  *  ,  then  ° 

ot  T  or 

is  associative,  as  is  the  case  for  (a,v)  or  (A,bndsum).  In  any 
case,  for  any  X1  ,  X2  and  any  f  :  X1  x  X2  -*  H  ,  letting 

X1  X2 

q  €  H  ,  e  H  be  arbitrary  such  that 

((■'••)  *  n  •>  ( •  )  A7„(--)  (we  can  always  choose  7 .  ( •  )  =  ( •  •  )  =  1)  , 
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then  nl(-)  A  a>j..(X1)  :  X 1  x  X ^  H  and  if2(--)  a  «..  ...(X2; 

X2  x  X2  ^  H  act  as  (nonunique)  local  identity  maps  relative  to 
i  .e.  , 


£*7 !  ( *  )  A  5.  .  (X!>  ) 


h<-->  »  ] 


o  f  =  f  . 


In  particular,  define  composition  o  for  Ar(Fuz(H))  as  above 
using  a  and  v  .  It  follows  that  °  is  indeed  associative  and 

for  all  f  «  Ar(X1,/71)  ,  (X2,/72)  (Fuz(H)  }  A  and 


/7  2  ( ‘  '  )  A  s.  ( X2 )  act  as  simultaneous  local  identity  arrows. 

It  follows  that  Fuz(H)  =  ( 0b( Fuz ( H) )  , Ar (Fuz (H)  )  is  a  cate¬ 
gory.  Note  that  explicitly,  if  f  €  Ar.v  „  ,  ,v  „  , (Fuz(H))  and 

^1*^1  9  A2  #/^ 2 

g€  Ar(X2,/72),(X3,/73)(Fuz(H))  '  ^en  since 
f  =  °hf  ( •  )  ,•  -A  nl(') 

for  some  hf  :  Xj  -  X2  ;  q  x  ( •  )  S  i72(hf(*))  .  and 


g  =  o 


.  .  .A  /?,(••) 


for  some  hg  :  X2  -*  X3  ;  rj  2( ••)  £  n3(hg{  -  •  )  )  ,  then  for  all  x1  e  xi  , 


x3  €  X3  ' 


(S,.f)(x1.x3)  -  v  ,,x  A  A  «h  , 

x2  ^  a2  *  a  46  g  ^  o 

=  8h  (h.(X,)),X  A  A  ^2  ^hf (Xi >  ) 

g  r  l  3 

=  ahg(hf (Xj) ) ,x3  A  ' 


n  x(-  )  s  /72!hf(-))  < 


and  hence  as  a  check. 


9°‘  *  4r(X1.71),(X3,,3||ruz(H>> 


We  summarize  some  of  the  basic  properties  of  Fuz(H)  : 
For  any  (X^.n^)  «  Ob(Fuz(H))  ,  i  =  1,2,...  . 
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Cartesian  products:  x  {X^.rj  =  (  x  X^  ,/7 ^  ( •  )  a/72  ( •  •  )  a- •  •  ) 

1=1 ,2, . . .  i=l ,2, . . . 

Power  class  and  subobiect  class  (up  to  Isomorphisms): 
»(X1,I71)  =  {(Xj.r)  I  re  Sub^,^)) 

where 


Sub(X1,/71)  =  (y  I  (X1<?)  e  Ob(Fuz(H))  and  r  £  /71(pt-wise) 
For  any  f.g  «  Ar,y  , v  , (Fuz(H))  : 


Equalizer:  ■»  .„  ,  ,y  _  ,  f  _  -  (X,.  v  ( f  ( •  .  y )  Ag  ( •  ,  y )  )  )  . 

i l  ' {X2‘n2' ,T,g  1  y«X- 


Monic :  f  is  a  monic  iff 

f(x’,y)  a  f(x”,y)  =  o(x,/x")  a  f(x’,y) 
for  all  x‘,x"  e  Xj^  ,  y  «  X2  . 


Epi :  f  is  an  epi  iff 

n o(Y)  =  v  f (x,y)  ;  all  y  e  X.  . 

X€X1 

Initial  object:  0FuZ(H)  “  (#<0) 

Terminal  object(s):  TFuz(H)  =  ((x),TH) 

where  (x)  is  any  singleton  set  and  as  usual  TH  is  the  maximal 

element  under  s  of  H  .  (See,  e.g.,  Eytan  [57].) 
n 


Pullbacks :  Let  f  e  Ar 


(X,r) , (Z,e ) (Fuz(H) ) 


g  «  Ar/V  .  a,(Fuz(H))  .  Then  the  following  commuta- 

1 1  )  I  \  i  I*  ) 

tive  diagram  holds 


pro  j 


( W,  x  ) 

I 

(X,y) 


(n 

(V.17 ) 


(Y,/7) 

i g 

(2,0) 


where  pullback  c  =  ( X,r )  x  ,  and  W  =  X  x  y  , 

f-Q 
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§  V  <  f  (•  ,z)  A  g(. .  ,z))  , 
Z  f  Z 


with  e.g.,  first  projection  determined  by 


P-OjJ*!,}  (',••/  x)  -  it*,'-)  a  ?(•  !  Al?(x)  A<7(-<)  A  0  ^ 

for  all  x  e  X  ,  where  a  is  the  Krtinecker  delta  function. 


Independently,  Goguen  developed  category  theory  connections 
with  Fuzzy  sets  [963 • 

Specifically, 


where 


Gog (H)  *  (Ob(Gog(H) ) ,  Ar{Gog(H) ) , 


Ob(Gog(H) )  *  Ob(Fuz(H) )  , 


and  it  follows  immediately  from  the  above  results  that,  for  any 
(Xj.Oj  3  .  <X2,/72)  e  Ob(Gog(H)  )  , 

Ar(X1,'71).  <X2'n2)  (GOg(H)  5 

*  {h  |  h  :  Xj  Mjl')  i  J72(h{-  ))} 

may  be  identified  with  Ar.v  .  ,v  .(Fuz(H))  ,  where  composition 

( • v ^ / # \ ^2^ 2 

here  is  ordinary  functional  composition  and  identity  maps 

id(X  /7 )  s  *dx  '  *,e’*  ordinary  identity  maps.  In  addition,  for  any 

( X , i? )  e  Ob(Gog(H) )  ,  the  class  of  all  category  subobjects  denoted 

9(X,rj)  ,  analogous  to  ordinary  power  class  and  fuzzy  class  nota¬ 

tions,  is  isomorphic  to 

{{K' ,g')  l  X'  S  X  &  rj  ’  S  q  pointwise  over  X’}  . 

Thus,  order  for  H  Induces  an  order  over  9(X,g)  • 

Consider  next  the  following  6  properties  which  any  given 
category  c  =  (Ob(c ) ,Ar(C ) )  may  or  may  not  satisfy: 

(Gl)  c  has  initial  and  terminal  objects  called  ef  and  T_  , 

c  c 

respectively 

(G2)  C  has  associative  images,  i.e.,  for  Ai'A2'A3  «  Ob(C)  ,  and 
f  *  A'Aj.A,"'  •  3  *  ArA2,A3(C!  ' 


gtfCAj))  =  (g° f ) ( Aj )  , 
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where  image  f  ( A  ^ )  is  defined  to  be  the  smallest  -  through  func¬ 

tional  composition  -  subobject  relative  to  f  and  A^^  ,  etc. 

(G3)  C  is  disjointly  a  complete  distributive  lattice.  That  is: 

for  each  A  e  Ob(C)  ,  9(A)  ,  the  class  of  all  subobjects  of  A  ,  is 
a  complete  distributive  lattice  ordered  according  to  subobject 
orderings,  i.e.,  composition  with  v  defined  here  as  a  least  upper 
bound  operator  and  a  as  a  greatest  lower  bound  one.  In  addition, 

noting  0C>TC  €  9(A)  ,  for  any  B,C  e  9(A)  ,  if  B  *  C  and  B,C 

are  atoms,  i.e.,  for  any  D  e  *(A)  ,  if  a  S  D  S  B  ,  either  D  =  0 

c  c 

or  D  =  B  ,  and  similarly  for  C  ,  and  noting  also  that 

CB  -  v  {x  |  x  e  9(A)  &  x  a  B  =  0  }  «  9(A)  and  similarly  CC  e  9(A)  , 

c 

then  CB  v  CC  =  T,  . 

c 


(G4)  For  any  y.  «  9(A)  ,  v  e  Ar  (C)  ,  j  e  j  ,  j  any  index 

J  J  Aj'A 

set,  where  if  i ,  j  «  J  ,  i  *  j  ,  then  A.  a  A.  =  0  ,  the 

1  9(A)  3  ° 


disjoint  union 


v  y  e  9(A)  and  the  latter  may  be  considered 
j  €  j  J 

also  as  the  coproduct  of  all  injections  y^  from  Aj  into  A  , 

j  e  J  .  Conversely,  any  coproduct  (or  cartesian  sum)  of  injections 
in  c  may  be  considered  a  disjoint  union  of  the  injections. 


(  G5 )  C  has  an  atom i c-mon i c ,  generator  P  which  is  projective, 
i.e.,  P  e  Ob(c )  is  such  that 

(i)  For  any  A  €  Ob(C)  ,  Arp  ft(C)  is  a  (nonempty  for  A  *■  0 ) 

collection  of  monomorph  i  sms .  such  that  for  any  y  e  9(A)  ,  with 

y  e  Arp  A(C)  ,  y  is  atomic. 


(ii)  For  any  A, B  e  Ob(c)  and  f  *  g,  f,g  e  Arft  B(c )  ,  there  is 
m  e  Arp  A(C)  such  that  f®m  *  g«m  . 


(iii)  For  each  A,B  €  Ob(C)  and  any  f  €  Arft  B(C)  ,  and 
m  «  Arp  B ( c )  .there  is  an  m'  e  Arp  A(c)  such  that 

f°m'  =  m  . 

(G6)  For  any  P  €  Ob(C)  ,  P  +  P  ,  the  co-product  of  P  with  itself 
is  not  isomorphic  to  P  . 


Theorem  ( Goquen  [96],  Carrega  [29].) 

For  any  completely  distributive  lattice  H  and  hence  for  any 
Heyting  algebra  H  ,  Gog(H)  satisfies  properties  (Gl)  -  (G6).  In 
conjunction  with  this,  Gog(H)  is  a  pseudotopos,  i.e.,  it  satisfies 
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properties  (i)  -  (iv)  (but  not  in  general  necessarily  (v))  for  the 
definition  of  a  topos  in  Appendix  1  (9)(xiii). 

(a)  The  initial  object  0  for  Gog(H)  is  given  by, 

Gog(H) 

0  =  (<j>,0) 

Gog(H) 

which  has  the  basic  property  that  for  any  (Y ,7)  e  Ob(Gog(H))  , 


r0Gog(H)'(Y^)(GO9(H))  = 


where  fv  :  0  -*  Y  is  symbolicly  defined. 

(b)  The  terminal  object  T  for  Gog(H)  is  given  by 

Gog(H) 


Gog(H) 


=  ( {x} ,Th) 


where  x  e  X  e  ob(SET)  is  arbitrary  fixed;  all 
isomorphic;  and  for  any  (Y,N)  6  Ob(Gog(H))  , 


Gog(H) 


,TGog(H) 


(Gog(H))  =  {f(y#J7)> 


where  here  for  all  y  e  Y 
f  (Y)  =  x 


(c)  Gog(H)  has  all  products  existing,  where  for  any 
(Xj.,j)  e  Ob(Gog(H) )  ,  j  e  J  ,  J  any  index  set, 

x  ( X  . ,  17  . )  is  isomorphic  to  (  x  X. ,  a  17  .(•.))  . 
jej  3  J  j«j  3  jej  3  3 

(d)  Let  (Xj,/7j)  «  Ob(Gog(H)  )  ,  j  e  J  .  Thus,  w.l.o.g. 

(X.,7.)  €  »(X,7)  for  X  =  X  X.  ,  7  =  v  q  pointwise.  Then  the 

3  J  jej  3  jej  3 

category  definitions  for  unions  and  intersections  of  (X^.q^)  ,  j  e  j 

in  terms  of  least  upper  bounds  and  greatest  lower  bounds  for  the 
order  induced  on  9{X,q)  coincides  with  the  Zadehian  definitions 


u  (X.,7  . ) 
j  e  J  3  3 

n  (X.,7.) 
jej  3  3 


(  u  X., 
j  €  J  3 


VH  *j> 

j  «  J 


(  n 
j  « 


ah  7j)  • 
€  J 


Similarly  for  C(X,7)  =  (X,Cjj7)  complementation.  U,7,C  all  have 
the  usual  properties. 
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(e) 

(up 


An  atomic  projective  generator 
to  isomorphic  uniqueness)  for  any 

( (x) ,0)  . 


P  for  Gog(H) 
x  e  X  e  Ob (SET) 


is  given  by 


Theorem  ( Goguen  [96]) 

Let  c  be  uny  category  satisfying  properties  (Gl)  -  (G6). 

Then,  up  to  isomorphism,  there  exist  a  unique  completely  distribu¬ 
tive  lattice  H  such  that  Gog(H)  and  C  are  equivalent  in  the 
sense  that  there  is  a  functor  F  i  C  Gog(H)  such  that  for  all 
A ,  B  €  Ob ( C )  ,  (F|Arft  B(C))  is  isomorphic  to  Ar FjA)  FjB)(Gog)(H) 

and  for  any  (X,^)  e  0b(Gog(H))  ,  there  exists  C  e  0b(C)  such  that 
F(C)  is  isomorphic  to  ( X,rj )  . 

Stout  [249']  has  pointed  out  that  (Gl)  (G2J  (G4)  (and  (G6)  in 
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a  trivial-like  sense)  hold  for  all  topoi,  whereas  (G3)  and  (G5),  in 
general,  do  not. 

Another  property  of  Gog(H)  that  is  desirable  is  the 
determination  of  pullbacks.  It  follows  by  using  the  property  of 
pullbacks  for  Fuz(H)  in  2.4.2  (C)  and  the  representation  of  Gog(H) 
in  Fuz(H)  (via  its  arrows)  in  2.4.2  (F)(2),  that  we  have  the 

following : 

For  any  f  e  Ar (X|y) ( (z ,9  ) (Gog(H) )  , 


g  «  Ar(y  q)  (2  9)(Go 9(H))  ,  the  following  commutative  diagram 


holds 


e  =  (w,* 


(x,y ) 


proj|« > 

P  J(Y,/7) 


i g 

(Z,9  ) 


where  pullback  C  =  (X,Y)  x  (Y,q)  , 

f  .9 

where 

W  =  X  x  y 

*(x,y)  =  tp  ( x)  A/7(y)  a  e(f(x))  a  a(f(x),g(y))  , 
for  11  x  e  x  ,  y  e  Y  ,  and 

proj!x!y)  =  pr°jxiw) 

In  particular,  if  g  e  Sub(Z,e)  ,  we  may  write  (from  2.4.2  (D)) 
w  .  1  .  o  .  g  . 


Y  =  Z  ,  g  =  id 


Z  ' 


q  S  9 


and  define 

Sub ( f )  :  Sub ( Z , 0 )  -  Sub ( X , Y ) 

by,  for  all  x  <=  X  , 

Sub ( f ) (g) (x)  =  k (x,f (x) ) 

=  ?(x)  a  q  (f (x) )  a  0  (f (x) ) 

=  'F(x)  a  q  (f(x))  . 

Hiqq's  topos  Hlqg(H)  . 

Higgs  developed  a  category  connected  with  sheaf  theory  [59], 
First  define 

Higg(H)  =  (ObfHigg(H))  ,  Ar ( Higg ( H ) ) )  , 
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where:  for  any  X  e  Ob (SET)  , 

Obx(Higg(H) ) 

=  )  I  9  :  X  x  X  -*  H  & 

(HI)  e  is  symmetric, 

(H2)  90s  s  g  . 

i  .  e .  , 

e(x,y))  a  s { y , 2 )  s  8(x,z),  all  x,y,z  e  X} 

Thus,  by  setting  x  =  z  and  using  symmetry,  it  follows  that 
for  all  ( X , 8 )  e  0bx(Higg(H))  , 

(1)  8  (-,••)  <  (D(e) (• )  a  D ( 8 ) ( • • ) )  , 

(ii)  (X,D(8 } )  e  0bx(Fuz(H))  , 

where  we  recall  the  diagonalization  D(8)  of  8  is 

D(c)(x)  =  e(x,x)  ,  for  all  x  e  x  . 

Also,  for  any  (Xa  ,  8  1 )  ,  (X,,  ,9  2 )  e  Ob(Higg)  , 


Ar(X1,ei) , (X2,82)(Higg) 

=  [f  I  f  :  X1  x  X2  -  H  & 


( H3 ) 

i  .  e .  , 

f  08  S  f  ,  8  2of  5  f  , 

1"  8  1  (Xj  ,X1  )  A  f(x1,x2)  S  f(X^,X2) 
|  f(xa,x2)  A  02(X2,X^)  S  ffXj.Xp 

for  all 

xi'xi  4 

’■  Xx,  X2'X2  €  X2  '  & 

( H4 ) 

i  .  e  .  , 

T 

fTof  S  02  , 

f(xa,x2)  A  f ( Xj , x2 )  s  e2(x2,x^) 

for  all 

Xx  e  Xj,  x2,yL2  €  X2  & 

( H5 ) 

V 

X2  € 

f(-  ,x  )  =  D<8  )(•  )  , 

X2 

(H6)  f  (  •  ,  •  •  )  S  D(8  j ) { •  )  A  D ( 8  2  )  ( •  •  )  j  • 

Composition  of  arrows  for  Higg(H)  is  the  same  as  introduced 
earlier  for  f  :  X^  x  X2  -*  H  ,  g  :  X2  x  X3  -►  H  ,  etc.  The  identity 
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arrow  under  composition  is,  for  any  (X,e)  e  Higg(H) 
id(x,0)  -  9  :  X  x  X  -♦  H  . 


Theorem  (Fourmann  and  Scott  [59],  [226']). 

Higg(H)  is  a  Grothendieck  topos  equivalent  to  the  category  of 
sheaves  over  H  ,  with  subobject  classifier  fi  being 

a  =  (H,1h),  1H  :  Hx  H  -  h  ,  lH(x,y)  s  1H  . 


We  can  write  symbolically  (see  Johnstone  [127],  pp.  8  -  12) 

H°P 

Higg(H)  =  Sh° ( SEx  )  , 

H°P 

where  Sh  =  r L  is  the  associated  sheaf  functor  and  SET  is  the 

category  of  all  presheaves: 

First,  Identify  Heyting  algebra  H  as  a  category  where  Ob(H) 
=  H  and  for  any  hj ,  h2  «  H  , 

0  ,  if  hj  >jj  hg 

(h^hj)  ,  if  hj  h2  , 

where  »  =  =>H  is  the  implication  operator  for  H  (see  section 
2.3.3  (C)(3)). 

Then 


Ar  h  (H)  S 
ni  ,n2 


where 


H°P  H°P  H°P 

SET  =  (Ob(SET  )  ,  Ar(SET  )) 


H°P 

Ob(SET  )  =  (F  |  F  :  H  -»  SET  is  a  contravar iant  functor) 


H°P 

and  for  all  F , G  «  Ob(SET  )  , 


;°P 


ArF,G^SET  )  =  (9  I  g  :  F  - ►  G  is  a  natural  transform) 


Theorem  (Pitts  [203]) 

For  any  (Xj,e  )  6  Ob(Higg) ,  j  =  1,2, 

f  *  Ar(X1,.1).(X2,.2)'HI^<ii>>  ■ 


and 
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T 

(a)  f  is  a  monomorphism  iff  fof  s 

(b)  f  is  an  epimorphism  iff  v  ffx,,* )  =  D(e  )(•) 

X  6  1 


(c)  f  is  an  isomorphism  iff  f  is  a  mono  and  epi . 

The  subobjects,  terminal  objects,  initial  objects,  etc.,  for 
Higg(H)  are  completely  analagous  to  those  for  Fuz ( H ),  (See  also 
Eytan  [57]  for  a  listing  of  various  properties  of  Fuzx(H)  (see  the 

next  subsection  F)  which  in  turn  can  be  directly  translated  in 
terms  of  Higg(H).) 


Now  consider  Higg's  category  and  two  point  coverage  functions 
(see  Chapter  5).  This  is  a  special  case  of  the  problem,  given  any 
f  :  *n(X)  -  [0,1]  , 

where 


»n(X)  =  (A  |  A  s  X  &  card(A)  s  n}  , 

for  some  fixed  integer  n  ,  to  determine  which,  if  any,  random 
subsets  S  of  X  are  such  that  their  n  point  coverage  functions 
match  f  ,  that  is,  for  all  A  e  *n(X)  , 

f ( A )  =  Pr ( A  £  S) 

Clearly,  when  n  =  1  ,  we  have  the  well  investigated  one  point 
coverage  problem  for  fuzzy  sets.  When  n  =  2  and 

f  «  Ob ( Higg ([0,1]))  (letting  once  more  H  =  [0,1]  from  now  on), 
this  reduces  to  the  two  point  coverage  problem  for  Higg  objects. 
Clearly,  for  any  random  subset  S  of  X  and  any  x,y  e  X 


(1) 

max(Pr (xeS) +Pr (yeS) -1,0)  s  Pr((x,y)  £  S )  s  min(Pr(xeS) ,Pr(yeS) )  , 

noting  that  upper  and  lower  bounds  also  coincide  with  the  maximal 
and  minimal  copulas  possible  with  respect  to  the  possibility 
function  Pr(-«S)  evaluated  at  x  and  y  .  (See  also  section 
2.3.6  on  t-norms,  t-conorms  for  similar  bounds  and  Chapter  6.) 

First,  let  us  remark  that  in  general  it  is  not  true  there 
exists 

f  :  [0,1]  x  [0,1]  -  [0,1] 

such  that  for  given  random  subset  S  of  X  ,  for  all  x,y  e  X  , 

(2)  Pr( (x,y)  £  S)  =  f(Pr(x  e  S)  ,  Pr(y  «  S))  , 

unlike  the  analogue  in  Sklar's  theorem  for  probability  distribution 
functions  [226]:  Reconsider  the  above  relations  in  terms  of  zero- 
one  random  variables  representing  whether  a  point  is,  or  is  not, 
covered  by  S  and  show  for  two  distinct  sets  (x,y)  ,  (x'.y1)  £  X  , 


Pr(x  €  s )  —  Pr(x'  c  s)  ,  Pr(y  e  s)  =  Pr(y'  e  s)  , 
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but 

Pr( {x,y}  s  S)  *  Pr { {x ' , y ' }  £  S)  . ) 

However,  for  certain  random  subsets,  this  is  not  only  true,  but  f 
is  also  a  t-norm.  For  random  sets  of  the  form  S ( A ; <f>& )  where  here 

A  s  A  ,  a  single  fuzzy  subset  of  X  ,  for  any  nil  and 

x,,...,xn  e  X  , 
l  n 

Pr({x1 . xn)  £  ) 

=  <f>&(Pr(x1  €  S(A;<t.&),-,Pr(Xn  €  S(A;<t>&))) 

=  ♦a(Vxi)-,,’1Vxn))  * 

(Special  cases  of  this  random  set,  include  Sy(A)  =  [U,l]  ,  for 

<}>&  =  min  and  T(A)  ,  for  <|>&  =  prod.  See  Chapter  6.) 

Because  of  eq.  (1)  ,  eq.  (2)  may  have  no  solution  in  S  , 

depending  on  choice  of  f  .  For  example,  if  we  choose  the  Higg's 

object  (X,f)  ,  where  D(f)  is  arbitrary  over  diag(XxX)  ,  but  such 

that  there  are  at  least  two  distinct  points  x_,y  e  y  such  that 

o  o 

f(xo'yo)  +  f(yo'Yo)  -  1  >  f ( *o • yo >  *  0  ' 
then  eq.  (2)  has  no  solution.  Or,  if  X  is  discrete,  let 

f (x,y)  =  >  0  if  x  “  y 

.  =  0  iff  x  *  y  , 

such  that 

(3)  z  f(x,x)>l. 

X  €  X 

Thus,  if  an  S  satisfies  eq.(2),  then  for  any  x  *■  y  , 

0  =  Pr  ( (x,y)  £  S)  , 

which  implies  S  =  {V}  ,  V  a  random  variable  over  X  ,  but  letting 

x  =  y  arbitrary  in  eq.  (2),  eq.  (3)  yields  a  contradiction. 

The  following  positive  result  does  obtain: 

1.  Given  any  space  X  and  a  random  subset  S  of  X  ,  there  is  a 
copula  such  that 

(4)  (f  (x,y)  ,  f  (y,z)  )  s  f(x,z)  , 

for  all  x,y,z  e  X  ,  where  for  all  x,y  e  x 
f (x,y)  =  Pr ( {x,y}  s  S)  . 
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2.  Given  any  space  X  and  =  min  ,  a  solution  class  of  f's 

satisfying  (4)  is  given  by 

f  =  Vi  A  V"1  ' 

for  all  fuzzy  subsets  A  of  X  .  Note  that  since  min  is  the 
maximal  t-norm,  any  solution  of  (3)  for  =  min,  i.e.,  any  Higg’s 

object  (X, f )  (assuming  f  symmetric)  must  also  satisfy  eq.  (3)  , 
for  faut  not  conversely. 

3.  Given  any  space  X  and  $5  ,  any  strict  Archimedean  t-norm 

[140]  (such  as  prod)  then  the  solution  class  of  all  f's  satisfying 
eq . ( 3 )  is 

(f  |  f  =  h-1°d  ,  d  is  any  pseudometric  over  X)  , 
where  °  is  ordinary  function  composition  and 

4>&(x,y)  =  h-1(h(x)  +  h(y))  , 

h  :  [0,1]  R+  ,  h(0)  =  4-»  ,  h(l)  =  0  ,  h  decreasing,  is  the 

canonical  representation  of  .  (See  2.3.6.) 

Finally,  it  should  be  noted  that  Cerruti  [30]  has  investigated 
the  category  of  fuzzy  set  relations. 

For  some  set  theory  connections  involving  Higg(H)  ,  see 

subsection  G  of  this  section. 

Next,  we  will  consider  several  important  relations  between 
Fuz(H)  ,  a  class  of  modifications  of  Fuz(H)  ,  Fuz^(H)  ,  and 

Higg(H)  as  well  as  some  connections  with  Gog(H)  . 

For  more  general  constructions  of  hich  Gog(H)  ,  Fuz(H)  , 
Fuzx(H)  and  Higg(H)  are  special  cases,  see  Apendix  3,  at  the  end 

of  2.4. 


F.  Relations  between  the  basic  categories. 

We  present  a  list  of  relevant  results  connecting  Gog(H)  , 
Fuz(H)  ,  modifications  of  Fuz(H)  and  Higg(H)  ,  Some  of  these 
results  are  motivated  by  the  desire  to  determine  which  of  the  above 
categories  are  topoi  -  topoi  being  in  a  sense  kinds  of  generalized 
sets.  (See  also  the  discussion  in  (G)  following  this  subsection.) 
In  the  following,  H  is  some  fixed  Heyting  algebra. 

(1)  Fuz(H)  is  the  type  II  Benabou  extension  y  (SET,H')  of  the 

deduction  category  (SET,H’  )  with  the  proviso-.  *  as  developed  in 
the  formal  language  in  section  2.2.2  is  to  be  replaced  formally  by 
ordinary  equality  (via  the  Kronecker  delta  function.  This  shows  up 
in  the  construction  in  constraint  (F3)  ,  Appendix  3,  section  2.4.2. 
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(See  also  Remark  3  of  the  Appendix.) 

(2)  Carrega  [29]  showed  that  for  any  complete  chain  H  (and  H  is 
thus  a  Heyting  algebra) ,  Fuz(H)  and  Gog(H)  are  isomorphic  under 
the  isomorphism  functor  F  :  Gog(H)  -•  Fuz(H)  ,  where  for  any  (X,e)  , 
(Y,y)  e  Ob ( Gog ( H ) )  and  for  any  f  «  Ar^x  g  ^  (Y  ,v) < Gog(H) )  , 

F( (X,0 ) )  =  (X,0)  , 

F  ( f )  :  X  x  Y  H  is  such  that,  for  all  x  e  X  ,  y  e  Y  , 

F(f ) (x.y)  =  5  (Y)  a  0 (x)  a  *(y)  . 

f (x) , y 

It  follows  from  Appendix  3  (at  the  end  of  2.4.2),  (iii)  and 
Remark  3,  Case  (4),  that  for  H  ,  a  general  Heyting  algebra,  and  F 
as  above 


F  :  Gog(H)  -  Fuz(H) 


is  a  (non-faithful,  in  general)  functor. 

In  addition,  Stout  [249‘]  has  indicated  differences  in  struc¬ 
ture  between  Ar(Gog(H))  and  Ar(Fuz(H))  through  the  fact  that  for 
any  (X,/7)  «  Ob(Fuz(H))  =  Ob(Gog(H)  )  and  any  singleton  (x)  , 


=  (f  |  f  :  (x)  -♦  X  and 


=  x<x> 


rjof  k  0} 


but  by  (FI)  , 

Ar(<X>.0).(X.,7)<FuZ<H>>  =  <V  ' 


where 


f  :  (x)  x  X  -  H  ;  f  s  0  . 
o  '  o 

(3)  It  should  be  noted  (see  Appendix  3  (iv).  Remark  3)  that  Gog(H) 

is  a  type-I  Benabou  extension  of  the  deduction  category  (SET,H)  , 
again  as  in  (1),  with  *  interpreted  as  ordinary  equality.  Thus 

Gog(H)  =  f j ( SET , H’ )  . 

(4)  Next,  let  us  modify  Eytan’s  original  Fuz(H)  [56]  to 
Fuz>t(H)  ,  where  in  the  type  II  Benabou  construction  in  (1)  -  and  in 

particular  in  (F3)  -  ,  *  is  retained.  See,  e.g.,  Appendix  3  (iv) 
and  section  2.3.3  (C),  Godel-intuitionistic  logic,  for  the  proce¬ 

dure  for  generating  the  intuitionistic  Kronecker  delta  e0  from  any 

object  (X,8 )  in  Higg(H)  ;  in  turn  e0  defines  a  corresponding  = 

over  (X,0 )  .  Similarly,  we  can  modify  Gog(H)  to  be  GogK(H)  . 


(5)  Thus,  it  follows  from  Appendix  3  (Theorem  2)  that 
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Fx  :  Gog  £H)  -►  Fuz  £H)  where  F  x  is  the  same  as  F  in  (2)  but 
with  a  replaced  by  a  suitable  e8's  ,  is  a  faithful  factor. 

(6)  Eytan  claimed  in  [56]  that  Fuz(H)  was  indeed  a  topos,  because 

(mistakenly)  of  Theorem  2,  Appendix  3.  But  since  (SET,H)  ,  in 
effect,  due  to  *  being  forced  to  be  ordinary  equality  and  hence 
not  in  general  compatible  with  the  Th  ,.(z)  (see  2.2.2),  is  not 

sYnK'“ 

a  formal  topos,  y  ^  ^  ( SET , H’ )  need  not  be  a  topos.  In  conjunction 
with  this  problem,  the  following  results  are  pertinant: 

(i)  Carrega  [29]  showed  that  Gog(H)  is  a  pseudo  topos  (again, 
see  subsection  D) .  In  addition,  if  H  is  a  complete  chain  (with  at 
least  three  elements),  then  Gog(H) ,  and  hence  by  (2)  above,  Fuz(H) 
have  no  subobject  classifier.  (See  Appendix  1,  end  of  section 
2.4.3,  for  pertinent  definitions.)  Thus,  in  this  case,  Gog(H) 
and  Fuz(H)  are  not  topoi . 

(ii)  Ponasse  [204]  and  Pitts  [203],  in  effect,  showed  that: 
Fuz(H)  is  a  topos  iff  H  is  a  Boolean  algebra,  in  which  case 
Fuz(H)  is  equivalent  to  Higg(H)  . 

Note  that  H  =  (0,1)  is  a  (trivial)  Boolean  algebra  and  as  a 
check,  indeed  Fuz((0,l})  may  be  identified  with  the  topos  SET  . 
There  is  a  great  potential  for  applying  the  last  mentioned  theorem 
to  set-valued  (or  interval-valued)  logics,  i.e.,  logics  in  which 
truth  values  are  actually  sets  of  numbers.  (See  the  interesting 
work  of  Shoesmith  and  Smiley  [234’].) 

Pitts  [203]  concluded  from  this  analysis  that  Higg(H)  ,  not 
Fuz (H)  ,  is  the  appropriate  vehicle  to  "fuzzify"  SET  ,  noting  that 
Higg  objects  simultaneously  fuzzify  membership  and  a  kind  of 
equality. 

(iii)  More  directly,  and  apropos  to  the  previous  comments,  if 

(c,a)  is  a  formal  topos  then  (Appendix  3,  Theorem  2)  y^tc,*) 

must  be  a  topos.  In  particular,  for  (SET,H’ )  (see  comment  (4) 

above)  it  must  follow  that  (SET,H  )  when  *  is  interpreted  as 
ordinary  equality  {=)  ,  In  general,  is  not  a  formal  topos.  Indeed 

we  can  show,  through  a  simple  counterexample  to  deduction  rule 
RI  Th  24  f2,2,2)  that  this  Is  so: 

Choose  any  X,Y  e  Ob  (SET)  ,  <|>  €  HY  ,  y  e  Y  ,  x  «  X  , 

s,t  «  »(X)Y  ,  0  <  Ii4>(y)n  <  iixes(y)n  <  Hx  e  t(y)n  £  1  .  Then  the 

hypothesis  of  Rj  Th  24  will  hold: 


•i <t* ( y ) ii  a  lixes(y)ii  £  iixet(y)n 
n$(y)»  a  iix€t(y)n  £  iixes(y)n 


But  the  conclusion 

ii ( Y ) ii  5  Hs(y)  *  t ( y ) ii 
is  contradicted  since 
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0  =  Hs(y)  *  t{y)n  <  n<p(y)ii  • 

On  the  other  hand,  not  any  choice  of  *  will  work,  since  for 
*  being  interpreted  as  being  derived  from  eg  ,  e  e  Ob(Higg)  , 

by  choosing  e  =  a  ,  we  know  efi  =  a  ,  which  in  turn  yields  * 

interpreted  as  ordinary  equality.  However,  if  we  choose 
e  H  x  H  -*  H  to  be  of  the  form  8  (x, y)  =  rj  (x)  a  rj  ( y)  ,  for  all 
x,y  e  H  ,  then,  efl  will  indeed  generate  *  so  that  all  axioms  and 

rules  of  Th  are  valid  and  (SET,H*  )  is  indeed  a  formal 

synK , K 

topos,  as  is  seen  by  inspection.  In  turn,  this  choice  for  {SET,H  ) 
yields  y  (SET.H-)  =  Fuz><(H)  as  a  topos  by  Theorem  2,  Appendix  3. 

(iv)  Pitts  [203]  also  related  Fuz(H)  with  Higg(H)  in  the  fol¬ 
lowing  way: 

Define  the  categories 

5  Hi*>sub=onst1H>  5  • 

where 


H1Mconst<K>  i  0b^const'S>>  '  Ar<H1»eon.t' <H|  1 1  ' 
0b(Higgconst(H) )  i  { (X,8 (XxX)  |  Xe  Ob(SET)}  , 

)  . 

)  . 

{ ( X , 8 )  I  X  €  Ob { SET )  ,  8  €  Sub(X,8 (XxX) , 

i.e.,8  :XxXt*H  with 

e(x,y)  s  8  (XxX)}  , 

'  y 

U  Ar(X,8 ) (Y,f ) (Hlgg(H) 5  1 

(  X  ,  8  )  ,  (  Y  ,  7  )  1 

€  0blHiggsubconst(H))j 
Define  also  the  mapping 

G  :  Fuz(H)  -♦  Higg(H)  , 

where,  for  all  X  e  Ob(SET),  rj  e  HX 

G(  (X,rj)  )  S  (X,G(/7)  )  . 


0b(HiggSubconst(H))  = 

Ar(Higgsubconst(H) )  = 


noting  a(X)  is  the  ordinary  Kronecker  delta  function  over  X 

u 
K, 

€0b( SET) 

Higgsubconstv“'  *  ^-i'^-^^subconst'11^  ^  ,Ar^H:iggsubconst^H^  ^ 


Ar(Higgconst(H) )  -  U  Ar(Higg(X,8 (XxX) ) , (Y,S (YxY) ) (H) 

A  ,  Y 
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<5(9)  =  5(.  .  )  <x  *  x)  a  J7(.  )  , 

and  for  any  (X,n)  •  ( Y , C )  «  Ob(Fuz(H))  ,  and  any 
f  €  Ar(X,/7)  .  (Y.e  )  (Fuz(H)  >  ' 

G(f)  S  f  «  Ar0((x„)KG((yitl)(Hlgg(Hl|  . 

Note  G(rj)  €  Sub(X,8 (XxX) )  .  Conversely,  if 
f  e  Ar(Y,e ) , (X,a (XxX) ) (Hi99(H) )  ^  a  monic,  i.e., 

f  e  Sub(X,a (XxX) )  ,  for  some  (Y,e)  €  Ob(Higg(H))  ,  then  also  f  is 
an  isomorphism  from  (Y,8)  to  G(X,y)  ,  where  (X,1P)  e  Ob(Fuz(H))  , 
and  where 


*<• )  v  f (y. * )  • 

y«Y 

G  is  a  full  and  faithful  functor  imbedding  Fuz(H)  into  the 
subcategory  HiggsubcQnst (H)  of  Higg(H)  . 

Furthermore,  consider  the  mapping  D  :  Higg(H)  -♦  Fuz(H)  ,  where 
for  any  f  e  Ar(Higg(H) )  ,  D(f)  =  f  and  for  any  (X,e)  e  ob(Higg(H))  , 


D((X.«))  -  ( X , D ( 8 ) )  €  Ob ( Fuz ( H ) )  , 
D(8 )  =  8 ( • , ■ )  :  X  -  H  . 


Then  the  following  diagram  holds: 
For  any  (X,8)  e  Ob(Higg(H))  : 


G(D(X,8 ) ) 

6  °»(HiggEubco„st(H|) 


an  epi 


G ( D ( X , e ) )  ,  a  mono 


(X,a(x  x  X) ) 

€  Ob(Higgconst(H))  . 


(X,e  ) 


Thus,  (X,e)  is  a  quotient  object  of  G(D(X,8))  ,  which  in  turn 

is  the  domain  of  a  subobject  of  (X,8(XxX))  ,  Hence,  Higg(H)  is 

the  quotient  object  completion  of  the  subcategory  Hi99Subconst ^  ' 

which  may  be  identified  with  Fuz(H)  . 

(v)  We  can  establish  two  additional  interesting  connections 
between  Fuz(H)  ,  and  more  generally  Fuzx(H)  ,  and  Higg(H)  . 

First  choose  any  function 

g  :  Ob ( SET )  -  Rel ( SET , H' )  , 
where  for  any  X  e  Ob(SET)  , 
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g(X)  €  hXxX  3  RelXxX(SET,H‘ }  . 

Let  (X,g(X)),  (Y,g(Y) )  e  Ob{Higg(H})  and  consider  any  f  e 

Ar(X,g(X) ) , (Y,g(Y) ) (Hlgg(H) 5  '  Then  noting  9  £  eg  (see  Appendix  3 

(iv)  at  the  end  of  2.4.2),  (H4)  implies  (F3)  with  right  upper 

bound  eg  .  Also  (H4)  ,  (H5)  imply  (FI)  holds  with  right  upper 

bound  the  same  as  in  Higg(H)  :  D(g(X))  a  D(g(Y) )  .  (F2)  and  (H5) 
are  the  same.  Thus  f  e  Ar (x>D(g(x)  j  j  ,  (YfD(g(y)  j  j  (Fu*  (H)  )  ,  and 

hence 

(a)  Ar(X,g(X) ) , (Y,g(Y) ) (Higg(H) 5 

S  Ar(X,D(g(X)  )  )  ,  (Y,D(g(Y)  )  )  (FuzJH)  >  ' 
for  all  X  e  Ob(SET)  . 

Secondly,  choose  any 

f  :  Ob  (SET)  -*  Rel  (  SET  ,  H*  ) 
where  for  any  X  e  Ob (SET)  , 

f (X)  «  HX  =  Relx ( SET , H’ ) 

and  in  turn  define 

gf  :  Ob ( SET )  -  Rel ( SET , H" ) 

where  for  all  X  €  Ob(SET)  ,  x,y  e  X 

gf(x,y)  -  ef(x,y) 

3  f (X) (x)  a  f (X) (y)  . 

Then  for  all  X  e  Ob(SET) 

fb)  Ar(X,f (X) ) , (Y, f (Y) ) (FuZ«  (H)) 

0  Ar(X,gf (X) ) , (Y,gf (Y) ) (Higg(H) ) 

{Cf (X) ,f (Y) } 

where 

Cf (X) , f (Y) (x,Y)  “  f(X)(x)  a  f ( Y ) ( y )  , 

for  all  x  €  X  ,  y  e  y  . 
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Proof: 

-  Clearly  for  all  X  e  Ob(SET),  (X,gf(X))  e  Ob { y ( SET , H  ))  . 

In  turn,  replace  *  by  *  =  e  and  consider  Fuz_  (H) 

gf  3f  gf 

For  any  f  e  Ar  f  (X)  )  ,  (Y,  f  (V)  )  ' 

gf 

(F3)  all  hold  for  f  with  right  hand  side  upper  bounds  being 
f(X)  a  f ( Y)  ,  for  (FI)  and  eg  for  (F3)  ,  etc. 

Now  form: 


f(x,y)  a  f(x,y*)  s  f (X) (x)  a  f { Y) (y) 


a  f  (X)  (x)  a  f  { Y)  ( y  •  )  a  e  (y ,y)  , 


for  all  x  «  X  ,  y,y *  e  Y  , 

But 

f  (X)  (x)  a  f  (Y)  (y)  a  f  (X)  (x)  a  f(YHy')  a  e_  (y,y) 

9f 

=  f (X) (x)  a  f (Y) (y)  a  f (Y) (y- ) 
s  f ( Y) (y)  a  f (Y) (y‘ ) 

=  gf  (Y)  (y,y  )  . 

Hence  (H4)  holds: 


f  (x.y)  A  f  (X,y  )  s  gf  (Y)  (y,y  ) 


for  all  x  «  X,  y,y  e  Y  . 

In  addition,  since  D(gf)  *  f  ,  (F2)  and  (H5)  are  the  same 

with  this  use. 

All  that  remains  is  (H3)  ,  which  becomes  here,  for  all 
x , x 1  e  x  ,  y,y’  «  Y  , 

gf(X)(x,x’)  a  f ( x , y )  =  f(X)(x)  a  f(X)(x)  a  f(x,y) 

s  f ( x ’ , y)  ? 

f(x,y)  a  gf(Y)(y  ,y)  =  f(x,y)  a  f(Y)(y)  a  f  ( Y>  (y 1  )  s  f(x,y)  ? 


Equivalently, 

f (X) (x* )  a  f(x,y)  s  f(x',y)  ? 

.  f (x, y)  a  f ( Y) (y ' )  s  f ( x , y* )  ? 

Taking  supremums  over  x  in  the  bottom  inequality  and  using 
( F2 )  implies 

f  (X)  (x)  a  f  (Y)  (y  )  s  f  (x,y'  )  . 

But  by  (FI) 

f (x.y* )  S  f (X) (x)  a  f (Y) (y* )  . 

Hence,  we  must  have  for  all  x  «  X  ,  y’  e  y  , 
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Higg(H)  =  u  Higg  (H)  , 
all  g  g 

where  Higgg(H)  is  like  Higg(H)  except  that  Higgg(H)  is 

restricted  to  objects  generated  by  g  and  arrows  are  likewise 
developed.  It  follows,  that  for  any  choice  of  g  there  is  a 
mapping 

Q'  :  Higg(H)  Fuz(H) 
where  for  any  g  as  before 

Q  •  :  Higgg(H)  -«  Fuzg  ,  etc. 

Eytan’s  [57]  claimed  that  Fuzr(H)  and  Higg(H)  are 

equivalent  under  the  mapping  D  ,  but  this  does  not  appear  so.  _ 

(vi)  Stout  [249’]  has  shown  a  natural  relation  directly  between 
Gog(H)  and  Higg(H)  via  the  level  sets  functor  >  .  (See  also 
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Chapters  5  and  6  for  related  results,  where  level  set  mappings  are 
used  to  connect  fuzzy  sets,  random  sets  and/or  flou  sets.): 

H°P 

>  :  Gog(H)  -  SET 

where  for  all  (X,/7)  s  0b(Gog(H))  , 

>(X,/7)  =  /7-1Ch,TH] 

^  {x  I  x  e  X  and  7(x)  ^  h)  , 

and  for  any  h:  ,  h2  e  H  ,  s  h2  ,  and  hence,  ^rrow  (  (hx  »  h2)  : 

ht  -  h2)  of  H  , 

>(X,i7( hlS»h2)  =  M#(XfJ7)(h  j  :  >(X,/7)(h2)  -  XX.^Xhj)  - 

Hop 

and  hence  >(X,/7)  e  SET  .  Also,  for  any 

f  e  Ar  v  /v  ( Gog ( H) )  ,  define  the  natural  transform 

I '  *7  1 '  '  '  ”  '  ^  2 

>(f)  :  >(X1,71)  -1-  >(X2./72)  - 

by,  for  all  h  e  H  , 

>(f)(h)  :  >(X1,i71)(h)  -  >  { X2  ,/7  2  )  ( h)  , 

an  ordinary  function  («  Ar(SET))  ,  where  for  all  k  e  >(Xj,/7^)(h)  , 
>(f)(h)  (k)  3  f  (k)  e  >(X2,/72)  (h)  , 

since 

/?  2  (  f  (k)  )  k  n1{  k)  i  h  , 

etc. 

>  preserves  limits  and  monies  and  the  composition  and 
extension  (see  also  2.4.2  (E))  functor 

Fq  3  Sh°>  :  Gog(H)  -  Higg(H) 

preserves  all  finite  limits  and  colimits  (and  hence  is  "exact")  as 
well  as  all  monies,  but  net  in  general  epis  and  is  hence  not  a 
faithful  functor.  In  addition,  it  has  a  right  adjoint  (see  [2491], 
Prop .  3 ) . 

Next,  let  us  reconsider  the  functor  in  subsection  2.4.2  (F)(2) 

F  :  Gog(H)  -  Fuz(H) 

where  we  add  the  subscript  1  for  notational  purposes  and  we  do  not 
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assume  necessarily  that  H  is  a  complete  chain.  Also,  reconsider 
from  subsection  2.4.2  (F)  (6)  (iv),  the  faithful  functor 

Gq  :  Fuz(H)  -  Higg(H)  , 

with  added  notational  subscript  o  . 

In  summary,  FQ  ,  F^  and  GQ  all  preserve  finite  limits  and 

colimits  and  all  monos,  but  in  general,  FQ  and  F1  are  not 

faithful,  while  G  is.  As  Stout  concludes,  in  any  case,  this 

o 

implies  that  all  three  functors  do  preserve  all  power  class  and 
hence  subobjects,  Gq  being  of  course  injective. 

Extending  Goguen's  observations  [96]  (see  also  2.4.2  (D)  , 

(G4)  )  ,  Stout  points  out  the  basic  idea  that  for  any  category  c 
with  all  limits  and  colimits  (finite)  existing,  for  any 

A ,  B ,  C  e  Ob  ( C  )  and  (f  :  B  -*  A),  (g  :  C  -♦  A)  e  5(A)  ,  the  intersec¬ 
tion  and  union  of  f  and  g  can  be  defined  as  equalizer  (•»„  „  -  ) 

I  ,y 

and  similarly  for  coequalizer  of  f  and  g  . 

Let  (X,/7)  €  Ob  ( Fuz  ( H )  )  =  Ob(Gog(H))  . 

Thus,  for  Gog(H)  ,  and  any  { X 1 ,  /7  2 )  ,  (X2«  /7  2 )  e  9(X,q)  , 

i.e. ,  Xj,  x2  s  x  ,  n  ,  n2  *  n  . 

(Xj ,rj j.)  U  (x2,/72)  =  (x2  u  x2,  nx  v  n2) 

where  q^vq^^q^  or  n2  >  f°r  appropriate  values  not  in 


( Xi  ' 77 1 )  °  {X2,'72)  =  (X1  °  X2 '  ni  A  n2)  ' 

For  Fuz(H)  ,  and  any  (X^,^)  ,  ( X2 ,  /7  0 )  €  ?(X,q)  ,  i.e., 

x1=x2=x,  n x.n 2  ^  q  , 

the  same  formal  definitions  hold. 

Similar  properties  hold  for  Higg(H',  But  in  addition,  as  a 
topos ,  noting  that  for  any  (X,¥)  e  Ob(Higg(H))  , 

<p  :  9  (X,y)  -  fi{X,1P) 

is  bijective,  where  n  =  (H,T„)  is  the  subobject  classifier  for 

Higg(H)  ,  all  set-like  operations  on  subobjects  of  Higg(H)  can  be 
natur- ily  defined  in  terms  of  membership  functions  in 

jjtX.v)  ,  jj  |  f  «  Ar  (Higg(H)))  where  analagous  to  the  situ- 

l  a,  "  j  ,  *1 

ation  for  general  logical  systems  (see  2.3.5),  we  can  define  for  any 
( X , A )  €  9(X,T), A  €  Sub  (X,y)  , 

qs  (  X  ,  A  )  =  <$>  ,  :  X  x  H  H 

A  , 
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where  for  all  x  e  X  ,  h  e  H  , 


*<X.A)(X’h)  = 


1 
l  o 


iff  3  y  e  X  with  h  =  A(x,y) 
iff  otherwise 


In  turn,  we  can  then  consider  basic  arrows  acting  like  &  ,  or  , 

if(  )  ,  nt  (abusing  notation) 

=  A  '  *or  =  v  '  *if(-)  =  =*  6  Arnxn,n(Hi9S{H)> 

and 


♦nt  =  r  e  Arn , n < Higg ( H) )  , 

and  then  compose  these  appropriately.  For  example,  for  any  (X,T  ^  ) , 
(X,r2)  e  ?(X,?)  , 


(X,*^)  u  (X,y2)  i  $~1($ 


( x,'P1 )  A  ♦(X,y2)) 


etc . 

It  follows  that  F  ,  F,  and  G  all  preserve  unions  and 

o  1  o 

intersections  as  well  as  all  natural  orders  relative  to  H  . 

Stout  further  defines  implication  and  negation  intuitionisti- 
cally  (see  again  2.3.3  (C)  (3))  for  Gog(H)  and  Fuz(H)  and  remarks 

that  the  usual  Zadeh  fuzzy  negation  <$>nt  =  1  -  (•)  can  be  obtained 

in  the  compatible  (adjoint)  intuitionistic  form 
♦nt(h)  =  ♦Jh'0)  ' 

with  $  (a,b)  =  $  (l-a,b)  ,  $  =  (min)bndsum  and  <}>  =  maxbndsum. 

3  U  a  Ui  01 

In  addition.  Stout  establishes  a  formal  language  and  a  basic 
semantic  evaluation  for  each  of  Gog(H)  ,  Fuz(H)  ,  and  Higg(H)  , 
analogous  to  the  development  in  2.3.2  and  Appendix  2  for  deduction 
categories  (C  ,»)  where  a  =  Sub  .  It  follows  that  Fq  ,  Fj  and 

Gq  all  preserve  semantic  evaluations.  Finally,  Stout  establishes 

sufficient  conditions  on  Heyting  algebra  maps  f  :  H  -♦  H 1  so  that 
the  naturally  induced  functors  F-  :  Gog(H)  -*  Gog(H’)  , 

Ff  2  :  Fuz(H)  -*  Fuz(H)  ,  and  F^  3  :  Higg(H)  -*  Higg(H’)  preserve 

certain  logical  properties.  For  example  (Theorem  9  [249’]),  if  f 

preserves  a  ,  v  ,  T„  ,  then  F„  ,  a  =  1,2,3  ,  all  preserve 

rl  r  ,a 

universal  quantification  of  implications  of  wff’s  composed  of  com¬ 
pounds  of  atomic  formula,  a  ,  v  ,  3  ,  TH  ,  J-H  ,  i.e.,  geometric 

statements.  F„  can  be  defined  as 

t  t  1 

Ff(1(X,h)  =  (X,  f°q )  , 

for  all  ( X , 7 )  e  Ob(Gog(H))  ,  and  for  any  g  s  Ar(Gog(H))  , 
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Ffl(g)  =  fog  ■ 


G.  Soae  connections  between  set  theory  and  topos  theory 

As  mentioned  previously,  one  of  the  motivations  for  considering 
topoi  is  the  set-? ike  structure  they  have.  That  is,  as  shown  in  2.2 
and  2.3,  the  set  foundations  part  Foun(z)  of  a  class  of  formal 
languages  t  car  be  semantically  modeled  -  as  indeed  all  of  I  - 
within  an  appropriate  topos.  Moreover,  there  is  always  a  canonical 
model  and  canonical  formal  topos,  and  by  the  Benabou  constructions, 
a  topos,  in  which  Foun(t)  can  be  evaluated  compatibly. 

We  present  here  a  summary  of  the  development  in  Johnstone 
[127],  Chapter  9  which  directly  connects  various  set  theory  models 
and  topos  theory  models  -  or  equivalently  topoi,  themselves. 

First,  recall  the  notation  and  results  of  section  2.3.4. 
Define  a  weakened  form  of  Zermelo-Fraenkel  set  theory  (ZF)  as 

Axzer  U)  ~  (Z2>-  <z7><  (ZID-  (212 )  ,  (Z13),  (Z14), 

o 

(Z15),  (Z16) , [ Z17 ) )  , 


which  can  be  shown  to  be  essentially  equivalent  to 
{ (21 )  ,  (Z2),  (Z4),  ( Z77  )  ,  ( Z 1 8  )  }  . 

Define  the  somewhat  stronger 

Ax  U)  ^  Ax  (l)  U  { Z19 ,  Z20)  . 
zer  zer 

o 

Also,  define  wpt  =  "^wpt^^  '  theory  of  well-pointed  topoi 

by 

AxwPt(i)  -  Axtopos  U  {wellpt> 


where  it  should  be  noted  via  the  identification  of  (c,Sub)  with 
c  ,  when  c  is  a  topos,  that  Ax. _  can  then  be  chosen  as  Ax 


when  c  is  a  topos 
(together  with  Ax 


syn 


topos 

for  required  syntax)  in  2.2 


K 

The  axiom  of 


K 


wel 1-pointedness  "wellpt"  is  given  in  [127],  p.  314,  in  concrete 
form,  which  may  be  abstracted  in  a  straightforward  manner. 

In  addition,  consider  the  topos-like  axioms  PT  ,  "partial 
transitivity"  and  TR,  an  axiom  connected  with  transitivity.  (See 
[127],  pp.  303-315.) 

Define  then  the  axiom  systems 


Axwtt(t) 

*  Axwpt(£)  U 

(PT)  , 

Axwtr(i) 

=  AxwptU)  u 

<TR)  , 

Axwttr(t) 

=  Axwpt(I)  U 

{ PT ,  TR ) 

zer 


(M 


Thzer<z> 


In  turn,  we  consider  the  set  theories  Th 
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ThZF(£)  and  the  topos  theories  ^topos^5  '  Thwpt(/'^  '  Thwtt(t)  ' 
Thwtr<*5  •  Thwttr<^  • 

Note  that  if  c  is  a  category  and  n  n  :  t  -*  c  (i.e., 
(c  ,  Sub)  )  is  a  model  for  Th(£)  £  Th.,.  (l)  ,  then  necessarily  c 

zero 

must  be  at  least  a  topos. 

For  a  model  n  u  :  L  -»  (C.Sub)  of  Th(£)  with  c  a  topos,  we 
will  write 

ll  ll  :  Th(£ )  -*  C  . 

Two  topoi  constructed  from  H  H  are  of  importance  here: 

s(«  »)  s  e  , 

where  construction  S  is  given  in  Johnstone  [127],  p.  315,  and 
where 

y(«  it)  =  it  ll  (Var (t )  )  =  Var(iicu)  £  c  . 


Then  abusing  notation  somewhat 


(i) 

If 

ii  ii 

1 

-*  c 

,  then 

y  (ii 

»)  : 

Thwpt^>  - 

y  (ii 

ii  ) 

(ii) 

If 

n  ll 

;  Thzer(£) 

c 

,  then  indeed 

y  ( n 

ll)  : 

Thzero^)  - 

c 

and  hence 

y  (ii 

ll)  : 

ThwPt(i)  - 

y  (ii 

ii  )  , 

in  addition, 

y  (  ii 

ll)  : 

Thwttr<£>  - 

y  (ii 

ll)  . 

(iii)  Conversely  to  ( i )  and  (ii): 
If  ll  ii  :  Thwp1;(t)  -*  c  ,  then 

S  ( II  II  )  :  ThzerU)  -  S(ll  ll  ) 


and  in  turn  from  (ii) 

y(s(n  ii))  :  Thwttr(A)  -  y ( S ( H  ll)) 

and  if  also  ii  ii  :  Thwtr(£)  -•  c  ,  then  it  can  be  shown  that 

y(S(n  H))  is  equivalent  to  the  subcategory  of  all  partially 
transitive  objects  of  C  (see  [127],  p.  311). 

(iv)  An  improvement  to  (ii)  ,  using  (iii): 

If  ii  ii  :  Th  (l)  -  c  ,  then  from  (ii) 
zer 


y  (  ii 

II)  : 

Thwtt 

r^ 

1  -  y( 

and  hence 

y  (ll 

ii)  : 

Thwpt 

U) 

y  (ii 

which  in  turn 

from 

(III) 

implies 
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sy  (II  II  )  )  :  Th  ze |£)  -♦  S(y  (II  II  )  )  , 

but  in  addition,  it  can  be  shown  that  S(y(li  11 ) )  is  isomorphic  to 
ii  ii 


(v)  Note  that  if 

II  II  =  ThwttrU)  -  C  , 

H  ii  :  Thwpt(i)  -♦  C  ,  and  (iii)  applies, 


(vi)  Thus,  in  the  above  cyclical  construction  sense: 
and  Thwttr(x.)  are  "logically"  equivalent. 


Th  (l ) 
zer 


(vii)  In  a  similar  vein,  strengthening  Th2er(i)  further  to 
Th2FU)  and  Th  uttr^L)  sufficiently  to,  say,  Thwg{z)  ,  then 
Th2F(i)  and  Thw^(£)  are  "logically"  equivalent.  (See  [127],  p. 
318.) 

(viii)  On  the  other  hand  (as  Johnstone  also  remarks),  replac¬ 
ing  Th„,(£)  and,  in  effect,  weakening)  by  Thv(i)  ,  as  given  in 

Zcl  fv 

section  2.2.2,  yields  the  result  that  (omitting  the  necessary  syntax 
theory  and  choosing  for  the  range  of  semantic  evaluations  (C,Sub), 
c  a  topos  -  see  2.3.2)  Thx(/.)  and  ThTopos(£)  are  "logically" 

equivalent . 

(ix)  Note  that  Higg(H)  is  a  well-pointed  topos.  (Apply 
Freyd's  theorem  to:  any  terminal  object  in  Higg(H) )  ,  ( (a } , T^ )  , 

(a)  any  singleton  set,  is  a  generator  -  see  Johnstone  [127],  p.  314 
and  Appendix  1,  9  (xii);  see  also  a  similar  property  for  the  non- 
topos  Gog(H)  in  subsection  ( D ) . )  Hence,  by  result  (iii),  a  model 
(using  the  S  construction)  can  be  constructed  from  Higg(H)  for 


(x)  For  applications  of  topos  theory  to  the  investigation  of 
consistency,  independence,  and  other  set  foundations  problems,  again 
see  [127].  See  also  Mitchell’s  interesting  exposition  [179]. 


> 
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Appendix  1 

Basic  category  and  topos  definitions 

1.  For  any  category  C  =  (Ob(C),  Ar(c))  ; 

Ar(C)  =  u  Ar  (e)  =  class  of  all  arrows  for 

U  <  r 

a  ,/3e0b(C  ) 

Ara  p(c)  =  class  of  all  arrows  from  a  to  p 
a,p  c  Ob(C )  , 

Ob(C)  «  class  of  all  objects  y 
S ,  B  :  Ar  -♦  Ob  # 

where  for  any  f  e  Ar  <c)  ,  a,p  «  Ob(c)  , 

Ct  ,p 

S(f)  i  a  (source  or  domain  of  f)  , 

B ( f )  ^  p  (end  or  codomain  of  f)  , 

D(c)  £  { ( f ,g)  |  f ,g  «  Ar (c )  &  S(f)  «  B(g)) 

£  Ar (c )  x  Ar(C)  , 

E(c)  £  { { f , g)  1  f,g  «  Ar (c )  &  s(f)  =  S(g)), 

£  Ar(C)  x  Ar(C)  , 

Id  :  ob (c )  -»  Ar  (C  1  ,  identity  map, 

°  :  D(C)  -«  Ar(c)  ,  composition, 
where  for  all  (f,g),  (g,h)  e  D(C)  ,  a  e  Ob(c)  , 

Id(a)  «  Araa(C)  . 

Sold  =  IdoB  =  id  , 

S(fog)  =  S(g)  ,  B { fog)  =  B(f)> 

(  fog)oh  =  fo  (goh)  , 

2.  In  addition,  category  c  may  have 

x  :  Ob (c )  x  Ob(c )  -  Ob(C ) 
a  finite  cartesian  product  operator 
<,>  :  E(C)  -♦  Ar(C)  , 

ordered  pair  operator, 

pro j  j  :  Ob(C  )  x  Ob(C  )  -  Ar(C) 

proj2  :  Ob(c)  x  Ob(c)  -*  Ar(C)  ,  projections, 

satisfying,  for  all  f,g  «  Ar(C)  ,  a,p  e  Ob(C)  , 

S( <f ,g> )  =  S ( f )  =  S ( g )  , 

B ( <f , g> )  =  B ( f )  x  B ( g )  , 


c  ; 

for  c 
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S  { pro  j  ^[a  ,p)  )  =  S(proj2(a,/J)  )  =  a  x  p  , 

Btprojjfa./J)  )  =  a  , 

B(proj  )  j  (a  ,/J )  =  p  , 
proj1(B{f ) ,B(g) )  o  <f,g>  =  f 
proj2(B(f ) ,B(g) )  °  <f,g>  =  g  . 

If  Bf  -  a  x  p  ,  then 

<pro j 1(a,p)of  ,  pro j2 (a ,p)of>  =  f  . 

3.  Category  SET  . 

SET  =  (Ob (SET) ,  Ar ( SET ) )  , 

Ob(SET)  =  class  of  all  ordinary  sets 

Ar ( SET )  =  U  Ar { SET)  . 

X, Y  X, Y 
€ Ob (SET) 

Arx  y(  SET)  =yX=(f|f:X-*Y  in  ordinary  sense) 

for  all  X, Y  e  SET 
Id  =  id  , 

i.e.,  for  all  X  €  Ob(SET)  , 

Id(X)  *  id(X)  :  X  -  X 
where  for  all  x  «  X  , 

id(X) (x)  -  x  . 

°  :  D  ( SET )  Ar  (  SET ) 
is  such  that  for  all  (f,g)  «  D(SET)  , 

fog  =  f  composed  with  g  in  ordinary  sense. 

Clearly  SET  has  x  ,  ordinary  cartesian  product,  and  proj1  , 
proJ2  ,  ordinary  projections. 

4 .  Categories  Preord^  . 

Let  s  :  Ob(SET)  -♦  Ob(SET)  be  a  fixed  mapping  such  that  for 
any  X  «  Ob(SET)  ,  s  X  x  X  is  a  fixed  preorder  on  X  .  Then 

Preord^  *=  ( 0b{ Preord^ ) ,  Ar(Preord^)) 


is  such  that 
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Ob(Preords 


{(x,sx)  |  X  «  Ob { SET ) } 


Ar { Preord^ ) 


x“y  Ar(X,sx) , (Y,Sy) 
«0b( SET) 


(Preord^ ) 


Ar(X,sx) , (Y,sy)Y(PreordS ) 


{£  |  f  :  X  -♦  Y  and  for  all  xltx2  €  x  > 

if  Xj  sx  x2  ,  then  f(xx)  sy  f(x2)} 
class  of  all  preorder-preserving 
functions  S  Y*  5 


o  is  usual  functional  composition. 


Id  -  id  . 


5 .  Deduction  categories. 

(c,a)  is  a  deduction  category  iff  c  is  a  category  with  a 
finite  product  (x)  operator  and  a  :  c  -*  Preord^  ,  for  some  fixed 

choice  of  S  ,  is  a  contravariant  functor,  i.e.,  for  any 
a,fi  ,i  «  Ob(c )  ,  and  any  f  e  Ar^  7  (e  )  ,  g  €  Ara  ^(c)  , 

a  (a),  a(/J),  a  {y)  e  Ob(Preords)  , 
a(f)  «  Ara(7)>a(/s)(Pre0rd5)  . 

»(g)  «  Ara(;Q)iSl{a){Preord5)  , 

and  noting  fog  e  Ar  1C)  , 

a  $  7 

a(fog)  =  a{g)oa(f)  e  Ara (7 }  % (a } (Preord^ )  . 

It  follows  from  part  (4)  that  for  all  x^,x2  «  a(V)  such  that 

X1  sa(ir)x2  '  then 

ajfjfxj)  :Sa(p)a(  f )  (*2)  • 

6.  Let  c  be  any  category  with  not  only  a  finite  product  operator, 
but  with  all  finite  limits  existing.  Then  (e,Sub)  is  a  deduction 
category,  where 


Sub  :  c  -♦  Preord 

Sub 

is  the  subobject  functor  and  *sub  is  chosen  as  follows  : 

Let  a  e  Ob(C)  be  arbitrary.  Let  f,g  «  Ar(c)  also  be  any 
two  monies,  i.e.,  for  any  fi'f2'al'92  e  Ar(c)  with 


B ( f  j )  =  B( f 2 )  =  S(f)  ,  S ( f x )  =  S(f2) 

Blgj)  =  B(g2)  =  S(g)  ,  S(ga)  =  S(g2)  ,  if 


fof 


1 


f 


2 


fof2  ,  then  fj 
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and  If 


go9i  =  gog2  ,  then  gj  =  g2 


Suppose  also  B(f)  =  B(g)  =  a  .  Define  f  sSut,(a)a  '  J,iJ’ 

f  =  g°h  ,  for  some  (monic)  h  e  Ar(c)  (B(h)  =  S(g),  S(h)  =  S  ( f ) )  , 

and  define 


,g  .  iff 


f  =Sub (a) 9 


f  SSub(a)g  and  9  SSub(a)f  * 

Then  it  follows  that  for  each  a  e  Ob(C)  ,  “Sub.a  generates  an 

equivalence  relation  among  the  class  of  all  monies  with  codomain  a  , 
partially  ordered  by  Ssub(a)  ’  Ca*i  this  class  the  class  of  sub¬ 
objects  of  a  ,  denoted  (Sub(a)  ,  ^ sub ( ot )  >  '  and  the  resuiting 

order  is  *Sub  . 

For  any  a,p  e  Ob(c)  ,  let  f  «  ArQ  ^(e)  be  arbitrary  and 
(0  :  y  -*  P)  «  Sub(/J)  arbitrary  (•»€  Ob(C))  .  Then  obtain  pullback 
(  =  t  x  a ,  where,  in  the  following  diagram  all  maps  commute  under 

g.  f 

compositions  (well-defined)  «  Ob(C)^  projections  pro j ^ ^  e  Ar  (c) 

y  C  r® 

are  all  unique  such  that  for  any  q  «  Qb(c)  with  s  €  Ar  (c)  , 

t  c  Ar  (c)  such  that  gos  =  f°t  ,  there  is  a  unique  r  e  Ar  .(C) 
n  ,a  0  1  K 

making  all  maps  commute: 


Then  define  the  monic 


Sub ( f ) ( g )  =  pro j 


(^*«) 


with 


identifying  the  equivalence  class  under  =sUb(a)  with 
(y*a) 

proja  H  «  Sub (a )  , 

Also  note  that  if  (h  :  a  -♦  p)  e  Sub(/J)  is  also  aroitrary  such 
that  g  sSub(£)h  .  then  g  *  h»k  ,  for  some  (k  :  t  -*  a)  and 

similarly. 


Sub( f ) (h)  »  proj 


(«*a> 


,  relative  to 


"Sub(a) 
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Since 


(*g«0  («**)  (fix-f) 

ProJa  H  =  proja  p  o  (icxid^) , 

fi 

it  follows  that 

Sub (f)(o)  ssub(a)Sub(f > {h)  ' 

and  hence 

Sub(f)  «  ArSub{/J)  Sub(a){PreordSsub)  , 

and  hence  Sub  :  C  -♦  Preord  is  a  contravariant  functor  and  thus 

sSub 

(C , Sub)  is  a  deduction  category. 

7 .  The  deduction  category  (SET. Sub)  . 

First  note  the  following  construction  and  commutative  diagram 
for  any  (non-vacuous )  X,Y  «  Ob  (SET)  and  any  f  €  Ar^  y(SET)  , 

i.e.,  f  *  Y*  in  the  ordinary  sense: 

Define 

rng(f)  =  (f(x)  |  x  «  X)  £  Y 

with 

f  „  :  X  -»  rng(f)  ,  (surjective) 
rng 

Sub(Y)  (rng( f )  )  :  rng(f)  -♦  Y  ,  (identity) 
given  by,  for  all  x  «  X  and  all  y  «  rng(f)  , 

'mg'*'  3 

Sub ( Y ) ( rng ( f ) ) ( y )  =  idy(y)  =  y  . 

Noting  that  F  ^  ^  (f_1(x)  |  x  «  rng(f))  forms  a  disjoint 
exhaustive  partitioning  of  X  ,  let  gf  :  Ff  -»  X  be  an  arbitrary 
choice  function  such  that  for  each  x  «  rng(f)  , 

gf(f_1(x))  e  f-1(x)  . 

Then  define 

Xf  =  rng(gf) 


and  define 
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sur 

Sub ( X ) (Xf ) 

finj 

fbij 

where,  for  all 


:  X  -*  Xf 
:  Xf  -*  X 

:  Xf  -  Y 

:  Xf  rng(f)  , 

x  «  X  and  all 


(surjective) 
{ identity) 

( injective) 
(bi jective) 

z  «  Xf  , 


fsur(x)  “  Sf(f_1(x)) 

Sub(X)(Xf)(z)  3  idx(z)  =  z 

finj(z)  ~  fbij(z)  =  idY<z>  -  z  * 


Thus , 


rng(f ) 


(*) 


Hence,  in  particular, 

f  =  Sub(Y)  (rng(f ) )  o  f 
Sub(Y) (rng( f ) )  =  f  o  Sub(X)(Xf)  °  f“Jj 


Thus  (without  even  assuming  f  is  a  monic)  indeed, 
=Sub(Y)  "  e£Iuivalent  to  the  monic  Sub(Y)  (rng(f ) )  . 

Furthermore,  if  Z  «  Ob(SET)  and  h:X-*Z,g:Z-*^ 
arbitrary  such  that 

( **  )  f  =  g°h  , 

Since  analogous  to  the  above  development 

g  =  Sub(Y) (rng(g) )  o  g. 


'rng  ' 


eqs .  ( * ) ,  ( *  * )  imply 


Sub(Y)  ( rng(  f ) )  **  Sub(Y)  (rng(g) )  o  k 
SSub(Y)Sub(Y){rng(g)) 


9rng  *  h  '  s«b<X)  (X,)  . 


f  is 

’  are 


where 
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Hence,  for  (SET, Sub)  ,  the  preorder  reation  for  Preord  must 
be  25  ■  s  ,  i.e.,  ordinary  subset  inclusion,  and  for  all 
X  «  Ob (SET)  ,  we  may  take  equivalently 

Sub(X)  =  (*(X) ,S*(X) )  • 

In  addition,  in  SET  for  any  X,Y,Z  €  Ob(SET)  with 
f  :  X  •»  Y  ,  g:Z-*Y,  the  pullback 

t  =  Z  x  X  *»  { { z ,  x )  |  z«  Z  ,  xe  x  ,  and  f  ( x )  =  g  ( z )  } 

Q.t 

=  g~ 1 ( rng ( f ) n r ng ( g ) )  x  f-1(rng(f)  n  rng(g))  £  Z  x  X  , 

and  proJx?  ^  ,  proj^  '  become  ordinary  X-  ,  Z  -  projection, 
respectively  over  {  . 

Hence,  for  any  X,Y  «  Ob  (SET)  and  any  W  €  Sub(Y)  ,  i.e.,  we 
can  let  W  «  *(Y)  ,  and  identify  further  W  with  the  identity 
imbedding 

gQ  =  Sub ( Y )  (W)  :  W  -*  Y  , 


Sub( f ) ( W)  =  Sub(f ) (Sub(Y) (W) ) 

-  proj£{ *  ,  relative  to  £y(x)  > 

where  now 

e  =  w  x  x 

0o.f 

»  ( ( w , x )  |  w  «  w  ,  x  e  X  and  f(x)  *  w) 

which  may  be  identified  with  the  inverse  mapping  f-1  :  W  -*  »(X)  . 
Hence,  we  can  let 


Sub( f ) ( W)  =  f  1 ( W)  , 


and  check  the  contravariance : 

For  any  VS>(X)  w  <  i.e., 

V  £  W  £  X  ,  V,W,X«  Ob ( SET )  , 

Sub(  f  )  ( V)  «=  f_1(V)  £  Sub  (  f  )  ( W)  =  f-1(W)  , 

etc . 

In  summary,  (SET, Sub)  is  equivalent  to  Sub:  SET  -f  Preord£ 

where 


0b( Preordg 
Ar ( Preord^ 


{  (»(X) ,£x)  |  X  e  Ob ( SET ) )  , 

rx,YU 

l«Ob(SET) J 


(Preord£ ) 
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Ar(»(X)  'C»(X)} ,(*(Y)  'S{Y>(Pre°rds> 

=  {f-1  |  f-1  :  *{Y)  -♦  *(X)  ,  where  f  :  X  •*  Y  is  arbitrary)  , 
under  the  usual  relation  composition 

(f°g)-1  =  g~1o{~1  ,  etc., 

and  for  all  X,Y  €  Ob(SET)  ,  f  ;  X  -*  Y  , 

Sub(X)  -  (»(X)-S(X))  ' 

Sub  ( f )  -  f-1  :  9(Y )  ->  *(X)  . 

8 .  The  deduction  category  (SKT.H*  . 

Let  H  «  Ob(Preord)  to  be  specified  later,  with  partial  order 
S  .  Define  contravariant  functor  h’  :  SET  -*  Preord  by: 

For  all  X  «  Ob(SET)  (i.e.,  for  any  ordinary  set  X  )  , 

H'  (X)  3  hX  *  {f  |  f  :  X  H  in  the  usual  sense)  . 

Define  the  partial  order  s  on  HX  by  if  f,g  «  HX  , 

HX 

f  s  x  g  iff  f(x)  g(x)  ,  for  all  x  e  X  . 

H 

For  all  X , Y  «  Ob(SET)  and  any  h  «  Ar^  y ( SET)  {i.e.,  for  any 
h  «  YX  or  h  :  X  -*  Y  ,  in  the  usual  sense)  ,  define 

H‘  (h)  :  HY  -  HX  , 

y 

by,  for  any  f  «  H  , 

H  (h)(f)  =  f°h  (ordinary  composition)  and  note  that  if 

f ,g  «  HY  with  f  S  g,  then  H' (f ) ,  H' (g)  *  HX  with 

H 

H  (f)  s  H  (g)  ,  since  f(h(x))  s„g(h(x))  for  all  x  «  X  . 

H  H 

Now  depending  on  the  choice  of  H  ,  (SET,H)  may  have  addi¬ 
tional  properties. 

Note  first  that  here 


Rel ( SET , H  )  =  U 


where 


XeOb(C) 


Relx(SET,H  )  , 


Relx(SET,H  )  =  H  , 

and  hence  substitution  here  becomes,  for  any  r  « 


noting 
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o(rv)  =  X  ,  and  r  :  X  •*  H  Is  an  ordinary  function,  and  for  any 

A 

h  :  Y  -*  X  ,  Y  «  Ob ( SET)  , 


r[h]  =  H' (h) (r)  =  r«h  , 

i.e.,  for  all  axioms  and  rules  involving  the  substitution  operation 
•[••]  ,  the  latter  becomes  simply  ordinary  composition  (in  the  same 
order )  . 

Thus,  e.g.,  if  H  has  structure  so  that  H  has  a  largest 
element,  say  T^  «  H  and  a  smallest  element  e  H  ,  with  respect 

to  SH  .  then  conditions  (T)  <R1|Synfl8  .  *IfThf3  •  Ri,syn,19> 

(i)  (RI,syn,22  *  RI,Th,7  '  RI.syn.23l  are  Mediately  satisfied, 
by  using  the  mappings 


T ,1  :  Ob (SET)  -  Rel ( SET , H’ )  as 


T(X) (x)  3  th  , 

l  (X)  (x)  3  i  ,  for  all  x  «  X  ,  for  all  X  «  Ob(SET)  . 


Next,  we  consider  some  structure  on  H 


\H  ,  to  correspond  to 


a  ,  although  not  necessarily  the  "and"  axioms  and  rules  given  in 
Loc(c ,a)  before  (RI#syn20.  Ri,Syn,21'  Rl,Th,4'  RI,Th,5'  RI)Th,6)  * 
For  example,  if  we  choose  H  to  have  an  operation  correspond¬ 
ing  to  a  =  min,  then  we  require  ah  ov^r  H  to  satisfy  formally 

rt  tv,  a  k  c  ,  i.e.,  for  all  elements  r,s,t  «  H  , 

I »  THi 4 , 5  f  o , 


if  r  s  ,  then  t  Afi  r  ,  r  ah  t  s  ,' 

if  r  s,t  ,  then  r  s  ah  t  . 


On  the  other  hand,  if  we  choose  H  to  have  the  more  general 
structure  of  a  t-norm  &H  ,  replacing  ah  (see  the  remarks  in  2.2.2 

VI  and  section  2.3.6)  ,  then  this  structure  will  be  reflected  in 

the  mapping  &  :  Rel ( SET , H  )n  -♦  Rel(SET,H  )  ,  where  for  example  for 
n  2  ,  and  any  X  «  Ob(SET)  ,  r , s  :  X  -*  H  relations  in 

Relx(SET,H  )  ,  &(r,s)  =  r&  s:X-*H  so  that  for  all  x  «  X  , 


(r  &  s) (x)  =  r(x)&Hs(x) 

with  the  t-norm  properties  of  symmetry,  associativity  (so  that 
unambiguously  &H  can  be  recursively  extended  to  any  number  of 

arguments  -  see  2.3.6),  nondecreasing,  bounded  above  by  min,  and 
having  the  marginal  properties,  with  possibly  additional  properties 
such  as  ldempotence,  etc. 

Similar  remarks  hold  for  or  and  a  t-conorm  structure,  as 

n 

well  as  for  other  operators  such  as  negation,  and  universal  and 
existential  quantification,  as  well  as  for  equivalence  and  set 
membership.  In  particular,  see  section  2.3.3  (C)  ,  GOdel-Intuition- 
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istlc  Logic,  for  a  brief  summary  of  Heyting  algebras  (choosing  in 
effect  H  to  be  a  Heyting  algebra)  and  intuitionistic  logic 
operators  over  them.  See  also  Remark  3  at  the  end  of  Appendix  3. 

9 .  Some  Miscellaneous  basic  category  concepts. 

We  will  exploit  commutative  diagrams  for  all  definitions: 

Let  c  =  (Ob(C ) ,Ar(c ) )  be  any  category. 

(i)  If 


Ida  Idfl  ' 

i.e.,  a,/J  «  Ob(C)  ,  f  e  Ara,p{C^  *  9  e  Ar/j,a<c>  ' 
with 


g- f  =  IdQ  ;  fog  =  Id^  , 


then  we  say  f  and  g  are  Inverse s  of  each  other  (with  either  one 
uniquely  determined  by  the  other)  and  a  and  p  are  Isomorphic 
objects . 


(il)  If.  for  some  a,p  €  Ob(C)  ,  f  «  ArQ  p(c)  ■  For  all 
■t  «  Ob(c )  ,  all  f\'{2  *  Ar1fa(r')  » 


Implies  fj  *  f2  , 


then  we  say  f  «  Ara  ^(C) 


Is  a  monte  (or  raonomorphlsm)  . 


(Ill)  If,  for  some  a,p  «  Ob  C)  ,  f  «  Ar  -(C)  :  For  all 

(X  t  fj 

1  €  Ob (c )  ,  all  t1't2  €  Ar^(7<c) 


a 


f 


P 


7 


Implies  f1  =  f2 


then  we  say  f  e  Ar^  ^{C)  is  an  epl 


( morphism) 


(lv)  Let  a  e  Ob ( c )  .  If  for  all  p  -  Ob(C)  ,  there  is  a  unique 

f  «  Ar  (C)  ,  called  a  terminal  arrow  then  a  is  called  a 

p  tCX.  p  ,  O. 

terminal  object  ,  In  whl ’h  case,  necessarily.  f  =  Id  and  any 

Q  ,  Q  CL 

two  terminal  objects  are  Isomorph' o  ^ften  we  write  1  for  the 
terminal  arrow. 
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(v)  Let  a  «  Ob(C)  .  If  for  all  p  e  Ob(C)  ,  there  Is  a  unique 
ga  e  Ara  ^(C)  »  called  an  initial  arrow,  then  ;  a  is  called  an 

initial  object  ,  In  which  case,  necessarily,  ga  a  =  Ida  ,  and  any 

two  initial  objects  are  isomorphic.  We  denote  a  «  0  . 

Note  that  for  SET  ,  monos  are  the  same  as  ordinary  injective 
functions,  epis  are  the  same  as  ordinary  surjective  functions,  iso¬ 
morphisms  are  the  same  as  bijective  functions  -  and  hence,  in  this 
sense  all  sets  having  the  same  cardinality  are  isomorphic;  any 
singleton  set  is  a  terminal  object,  the  null  set  is  an  initial 
object  (by  definition) . 

(vi)  Subobjects  have  been  discussed,  the  definition  being  moti¬ 
vated  by  monos.  Dually,  quotient  objects  may  be  defined,  relative 
to  all  epis  (for  a  fixed  category)  where  ordering  squo  epis  f,g 

with  S ( f )  =  S(g)  ,  is 


f  5 Quo  9  iff  f  “  h°9  ' 

and  where  =quo  9  is  defined  by  the  usual  relation 
f  “Quo  g  iff  (f  Vo  g)  and  {g  SQuo  f)  ' 

etc . 

For  SET  ,  it  follows  that  for  any  X  e  Ob(SET)  ,  Quo(X)  ,  the 
class  of  all  quotient  objects  over  X  ,  can  be  identified  with 
(Y  |  Y  €  Ob ( SET )  and  X  £  Y)  . 


(vli)(a)  Finite  products  are  defined  analogous  to 
the  right  half  of  the  commutative  diagram  removed  - 


that 


is  removed,  etc. 


pullbacks  with 
as  in  ( 6 )  ,  so 


Thus,  for  example,  for  two  objects:  Let  a,p  e  Ob(C)  .  Then 
the  product  a  x  p  «  Ob(C  )  ,  if  it  exists,  where  a  and  p  are 
called  its  factors  and  arrows  proj  ,  proj  (see  below)  called  the 

projections,  are  such  that  for  each  1  e  Ob(c )  and  each 
f  «  Ar7  a  ,  g  c  Ar7  ^(c)  ,  there  is  a  unique  kfg  e  Ar7^ax  (c) 

such  that  the  following  diagram  commutes: 


P  • 


(b)  Finite  coproducts:  For  example,  for  two  objects:  Let 

a  ,p  €  Ob(C)  .  Then  the  co-product  a]J./3(or  a  t  p )  e  C  ,  if  it 
exists,  where  a  and  p  are  called  its  summands  and  arrows  inj  , 

Cl 

inj^  (see  below)  called  Injections,  are  such  that,  for  each 
1  «  Qb(C)  ,  and  each  f  €  Ar  ,(C)  ,  there  is  a  unique 

GC  t  j 

hf  g  e  Ara  |  |  y  (C  )  ,  g  e  Ar^  7  ( c  )  such  that  the  following  diagram 

commutes  : 
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Note,  for  SET  ,  all  products  coincide  with  the  usual  cartesian 
products  of  sets  and  all  co-products  with  the  usual  cartesian  sums 
of  sets. 

(viii)  If,  for  some  a,p  e  ob{c  )  ,  f,g  «  Ara  ^(c)  :  For  all 

q  e  Ob(C )  ,  all  k  e  Ar  ( C )  ,  all  l  «  Ar  (c) 

•7  #  ”  n  i a 


then  we  say  y  a  p  f  g  is  an  equalizer  for  a.  ,  p  ,  f  ,  g  with 

unique  arrow  h  f  «  Ar  (c )  . 

«  »/5 .  f  »g 

For  SET  ,  and  any  X  ,  Y  «  Ob  (SET)  with  f ,  g  :  X  -*  Y  , 

1 X , Y , f , g  =  <x  I  x  €  x  and  f(x)  =  9<x)>  • 

(ix)  Suppose  category  c  has  all  finite  products  existing. 

For  some  a,p  €  Ob(C)  :  If  for  each  y  «  Ob(c)  and  each 
f  «  Ar  „(C)  >  there  is  a  unique  h  «=  h  e  Ar  ( C )  such 

that 

a  x  ■»  - 

id  x  h  I 
a 

a  x  /3a 

then  we  say,  for  that  a,/J  e  Ob{C)  ,  exponential  pa  e  Ob(C)  exists 

with  evaluation  ev  «  Ar  „a  „{C)  . 

ax/3  ,/j'  ' 


For  SET  ,  for  all  X,Y  e  Ob(SEt)  , 


YX  =  (f  |  f  :  X  -  Y)  and  ev  :  X  x  YX  -»  Y  is  such  that  for 
all  x  €  x  ,  f  e  YX  ,  ev ( x , f )  =  f(x)  . 


(x)  If  there  is  some  terminal  object  U  e  Ob(c ) 
n  «  Ob(c)  and  gQ  e  Ary  fi(c)  such  that  for  any 

representative  subobject  (f  :  a  -  p)  of  p  , 
hf  *  Ar^  n  such  that  subobject  (project  )  -*  a) 

are  ^sub(£)  equivalent,  where  {  3  Ug  xh  p  : 


and  an  object 
p  «  Ob ( C )  and  any 

there  is  a  unique 
and  (f  :  a  -*  p) 
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pro  j 


pro  j 


9r 


Then  gQ  is  called  the  true-mapping,  hf  the  cl  asst  fying  or 
character  i  at  I  c  map  ,  n  the  subobject  classifier,  and  the  power 

object  is  s 2 P  (from  part  (ix)). 

For  SET  ,  we  may  choose  for  any  X  €  Ob  (SET)  ,  W  e  Sub(X)  , 
i.e. ,  W  £  X  , 


{ 1  }xW 


Sub ( Y ) ( W) 


That  is,  the  subobject  classifier  is  n  =  (0,1)  ,  the 

true-mapping  gQ  »  id  :  (1)  -♦  (0,1)  ,  id(l)  =  1  ,  and  the 

classifying  map  here  is  hf  =  <|>M  :  Y  -»  (0,1)  ,  where  <|>w  is  the 

ordinary  membership  (or  characteristic  function)  map. 


(xi)  More  on  subobjects  of  a  category  c  . 


(a)  C  has  assoc  I  at  t  ve  Images  iff,  by  definition, 
a , p  ,t  €  0b(C )  ,  and  all  f  e  Ar  -(C)  ,  g  e  Ar  (C) 

Cl  t  p  p  t  » 

(unique)  smallest  (in  sense  of  Ssub(/J)'  SSnb(i)) 
written  f(a)  of  p  and  g (p)  of  t  such  that 

f  5Sub(/J)f(a)  '  g  SSub(^  )3(/5) 


for  all 
,  there  are 

subobjects , 


and  such  that 


g(f (a)  =  (g°  f ) (a)  • 

(b)  C  is  said  to  be  complete  distributive  lattice  (c.d.l.)  - 
ordered,  iff,  by  definition,  for  all  a  *  C  ,  Sub(a)  under  sSub 

is  a  complete  distributive  lattice  (c.d.l.) 


(c)  If  c  is  a  c.d.l.,  then  for  c  all  classical  unions  and 
intersections  of  subobjects  exist,  where  we  define  for  any 
a  6  Ob (C)  and  any  p ,i  «  Sub(a)  ,  p  U  i  ,  p  n  t  e  Sub(a)  ,  as  the 
least  upper  bound,  greatest  lower  bound  for  p  and  i  relative  to 

S  Sub  (a )  '  resPec'tiveiY'  In  turn,  the  pseudo-complement  p‘  of 

P  «  Sub (a )  is  the  least  upper  bound  for  the  class  of  all 
q  «  Sub(a)  with  q  0  p  =  ,  where  ma  is  the  minimal  element  in 

Sub  ( a )  relative  to  s$ub(a)  *  We  say  £  > 1  *  Sub(a)  are  disjoint 

iff  p  n  i  «  ma  ,  etc. 
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(xii)  An  object  a  «  Ob(C)  is  called  aortic  iff  for  all  f  €  Ar(C) 
with  S ( f )  *  c  ,  f  is  monic. 

An  object  a  «  Ob(c)  is  called  atoalc-aonlc ,  iff:  (i)  a  is 
monic;  (ii)  for  each  i  e  Sub(a)  ,  i  is  atoaic ,  i.e.,  (assuming 
Sub  (a)  isac.d.l.)  if  ir  *  ma  (the  minimal  element  in  Sub  fa) 

w.r.t.  £  sub ( a )  and  **  *7  *  Sub(a)  is  such  that 

ma  ^Subja)'?  *Sub (a)1  '  then  n  "  “a  °r  ">  “  7  :  <A11>  for  an* 

P  «  Ob(C )  ,  Ar  .  #  0  . 

a. ,  p 

An  object  at  €  Ob(c)  is  called  a  generator  (or  separator)  of 
c  ,  iff,  for  any  p  ,t  e  Ob(C)  and  any  f,g  «  Ar^  7(C)  with 

t  *  g  ,  there  is  some  h.  «  Arfl  (C)  such  that  f°hf  *  g°h. 

More  generally,  a  set  {  a^,...,a  )  £  Ob(C )  is  called  a  generating 


(or  separating)  set  of  objects  for  c  ,  iff,  for  all  p.i  e  ob(C) 

and  f,g  e  Ar„  (c )  with  f  i*  g  ,  there  is  some  j  ,  1  <  j  £  h  , 

P  >  ~ 

and  h,  «  Ar  (e)  such  that  f«h.  *  g°h.  . 

J  aj'A*  J  J 

An  object  a  €  Ob(C)  is  called  projective  iff,  for  each 

f  e  Ar (c )  and  each  epi  g  e  Ar(C)  with  a  =  S(f)  and  B(f)  =  B(g) 


there  is  some 


hf,g  €  Ar(C) 


such  that 


(and  hence 


S(hf  )  =  S ( f )  ,  B(h,  ) 
A  •  y  *  »u 


s(g) ) 


For  SET  : 

For  any  X,Y  «  Ob(SET)  ,  f  «  Arx  y(SET)  =  Y*  ,  the  image 

f(X)  =  Sub ( Y) ( rng ( f ) )  with  f  sSu^(y)^(x)  and  clearly  SET 

associative  images  and  is  a  c.d.l.  under  5_  .  ■  c  with  unions 

aUD 


has 

and 


intersections  of  any  subobjects  W  e  Sub(X)  =  ?{X)  being  ordinary 
unions  and  intersections  of  Ws  ;  mx  =  0  ;  pseudo-complement  W 


of  W  «  Sub(X)  is  the  ordinary  complement  CW  =  X  -I  W  , 


The  monic  objects  of  SET  consist  of  all  singleton  sets  (x)  , 
x  «  X  ,  X  «  Ob(SET),  i.e.,  the  monic  objects  and  terminal  object(s) 
coincide.  The  atomic  objects  of  SET  are  also  the  singleton  sets, 
while  any  nonvacuous  X  €  Ob (SET)  is  a  projective  generator,  since, 
for  any  Y,Z  «  Ob(SET)  and  any  f  :  X  -»  Y  and  g  :  Z  -»  Y  surjec¬ 
tive,  we  can  define  hf  :  X  -♦  Z  by  choosing  for  each  x  e  X  , 


h,  (x)  arbitrary  fixed  «  g-1(f(x)) 
*  » y 


For  SET  ,  the  atomic-monic 


projective  generators  are  the  same  as  the  singleton  sets. 


(xili)  A  top os  is  a  category  for  which 

(a)  All  finite  products  x  exist, 

(b)  Has  a  terminal  object, 

(c)  Has  equalizers  for  all  pairs  of  objects  in  it, 

(d)  Has  exponents  and  evaluation  maps  existing  for  all  pairs 

of  objects  in  it, 

(e)  Has  a  subobject  classifier. 
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In  a  topos,  an  arrow  is  an  iscmorphisn  iff  it  is  both  an  epi 
and  a  mono. 

(xiv)  Some  basic  properties  of  functors  and  related  concepts. 

(A)  Let  c  ,  3  be  two  categories.  A  functor  9  :  c  -*  3 

consists  of  two  maps 

(i)  9  :  Ob  (C  )  -♦  0b(3 ) 

(ii)  9  :  Ar(C )  -►  Ar(C) 

such  that  either 

(a)  For  all  f,g  «  Ar(C}  ,  a  e  0b(C) 

S(9(f))  =  9(S(f))  .  B  (3=  ( f )  )  =  9(B(f)}  , 

If  S(f)  =  B(g)  , 

»( f°g)  -  *(f)  °  *(s)  . 


in  which  case  9  is  said  to  be  a  covariant  functor; 
or 


(b)  For  all  f,g  <  Ar(C)  ,  a  e  0b(C)  , 

S(9(f))  =  9 (B( f ) ) «  B(9(f))  =  9(S(f)) 

If  S(f)  =  B(g)  , 

9 ( fog)  =  3(g)  o  F ( f ) 

3(!da)  =  Id,(tf)  . 

in  which  case  9  Is  said  to  be  a  contravar I  ant  functor, 

(B)  Let  f  :  c  3  be  a  functor.  Then: 

(1)  9  is  full  iff,  by  definition,  for  all  a,p  e  Ob(c)  , 


5(Ara,/J(C))  §  {?(f)  !  f  €  Ara,p{C)> 

“  Ar9<a),9(/5)<*>  * 


(2)  F  is  faithful  iff,  by  definition,  for  all 

f,g  e  Ar(c)  ,  if  f  *  g  ,  then  9(f)  *  9(g)  ,  i.e., 
9  :  Ar (c )  -♦  Ar(3)  Is  Injective. 


(C)  9  Is  representative  iff,  by  definition,  for  all 

a  «  Ob (3 )  ,  there  is  some  p  «  0b(c)  such  that  a  Is  isomorphic  to 
9(p)  . 


(D)  9  is  an  equivalence  between  c  and  3  (or  C  and  3 

are  equivalent  w.r.t.  9)  Iff,  by  definition,  9  is  full,  faithful 

and  representative.  iff  (a  theorem  -  see,  e.g.,  MacLane  [165]  ,  p. 

01)  there  is  a  functor  $  :  s  -*  c  and  natural  isomorphisms  between 

g«3  :  c  -♦  c  and  Id  :  c  -♦  c  and  natural  isomorphisms  between 

c 

3og  :  3  -»  3  and  Id^  :  3  -•  3  .  In  particular,  if,  more  strongly, 
go?  =  Id  and  9og  =  id  then  we  say  9  is  an  isomorphism  between 

C  3 

c  and  3  ,  where  natural  transforms  and  natural  isomorphisms  are 

defined  below: 


Let  c  ,  3  be  two  categories. 
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r 

T 

t 


Let 

:  Ob(C ) 
(a)  *  Ar 


« 


Ar 


o-.P 


y,g  :  c  -*  a  be  two  functors.  Then  the  mapping 
-*  Ar(a)  such  that  for  all  a  «  Ob(C)  , 

\  mi  >{*)  such  that  for  all  p  e  Ob(C)  and  all 

"  )  IB  ) 

(C)  ,  the  following  diagram  is  commutative 


*<«)  g(a) 

|  5(f)  1  ?(f) 

»</>)  -njy*  S</»)  . 

and  we  denote  symbolically:  r  •.  9  — g  and  call  r  a  natural 

transform  from  9  to  g  .  If  r  9  — g  is  a  natural  transform 
from  functors  9  to  g  ,  where  9 ,  g  :  c  -♦  a  ,  for  categories  c  , 
a  ,  and  if  also,  for  each  a  e  Ob(C)  ,  r(a)  is  an  isomorphism  (as 
previously  defined),  we  say  r  is  a  natural  Isomorphism  from  9  to 
<3  • 


Summary  of  Isomorphism  concepts 

Let  c  ,  a  be  any  categories: 

(1)  A  given  a,p  e  ob(c)  are  isomorphic  iff  an  isomorphism 

f  «  Ar  -(C)  exists  iff  there  is  f  «  Ar  -(C)  ,  g  «  Ar„  (C )  such 
a  ,p  a  ,p  p  ,a 

that 


fog  «  id^  and  gof  *  Ida  . 

(2)  A  given  f,g  e  Ar(c)  are  isomorphic  iff  there  are  isomorphisms 
h,k  e  Ar(C)  such  that 

h°  f  ■  go  k  , 

in  which  case  where  well-defined  fog  is  an  isomorphism;  Ida  is 

always  an  isomorphism,  for  all  a  e  Ob(C)  ;  if  f  €  Ara  ^(C)  is  an 

isomorphism  between  a  and  p  *  Ob(C)  then  so  Is 
9{t)  «  Ar  9(aj  9  (/j)(®)  '  for  any  functor  9  :  c  a  . 

(3)  Let  »,g  :  C  -♦  a  be  two  functors.  The  r  :  9  — — ♦  g  is  a 

natural  Isomorphism  iff  there  Is  a  o  :  g  ►  9  ,  natural 

transformation  such  that  o«r  *  Id?  and  too  =  Id^  ,  iff,  by 

definition,  9  and  g  are  isomorphic  (relative  to  r  or  to  a  - 

both  o  and  r  being  natural  isomorphisms). 

(4)  Let  c  and  a  be  two  given  categories.  Then 

(i)  c  and  a  are  isomorphic  iff  there  are  functors  9  :  c  -+ 

a  ,  g  :  a  -*  c  such  that  g  =  lda  ,  g®s  =  Idc  . 

(ii)  c  and  a  are  equivalent  Iff  there  are  functors 

9  :C-*a  ,  g  :S-»C  such  that  5®g  Is  isomorphic  (via  a  natural 
isomorphisms)  to  Id9  and  g°?  is  isomorphic  to  Idc  . 
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Let  5 

:  C  -*  S 

and  $ 

:  3  -*  c  be  two 

functors  for  categories 

c 

and  3  . 

Then  we  say  5 

is  Je/*t 

adjoint  to  Q  and  3  is  right  adjoint 

to 

9  iff. 

by  definition; 

(  )  .  (  )  are 

bijections  ,  where 

(i) 

For  all 

a  « 

Ob(c )  ,  t  ,q  e 

0b(3)  and  all  f  e 

Ar 

?<a),7(a>) 

'  h  €  Arp,alC>  ' 

k  «  Ar  (3)  , 

7  /  v 

9(P) 

?(h) 

-4  5(a)  - 

— L_ *  ,  k 

—  n 

9 

3 

3 

* 

g(,)  9(k)  ,  <3  (//  ) 


(foff(h))*  =  f*«h 
(kof)*  =*  q  (k)«f  *  , 


f  being  the  right  adjoint  of  f  , 


(ii)  Equivalently,  for  all  a  ,/J  «  Ob(C)  ,  i  ,rj  s  0b(3)  ,  and  all 
9  «  Ara,3(,)(C)  '  h  €  Ar/5,a(C>  '  k  €  Ar,.9(*>  ' 


*(/»)  — (h)-"  *(«) 


g(7)  ..g(k)  >  g  (J7  ) 


(S(k)og)  =  k°  g 
(g=>h)  =  *g°5(h) 

Note  that  for  any  f  e  Ar?(a)  •»(*)  and  any  9  €  ArQ  ^  ,(C)  . 

as  above , 

*  *  *  * 

(f  )  =  f  ;  (  g)  =  g  . 
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Appendix  2 

Deduction  Categories  and  Foraal  Topos  Properties 

Let  (c,»)  be  a  deduction  category,  with  the  usual  notation 

c  =  (Ob(C ) ,Ar(c ) )  ,  Ar(C )  =  u  Ar,  .(c)  ,  the  arrow  class, 

i ,  j«Ob(C  ) 

and  Ob(C)  ,  the  object  class.  We  assume  the  initial  object 
0  e  Ob (c )  . 

In  addition,  define  the  a -relation  (object)  class 

Rel (c ,a)  5  u  a(i)  s  Ob(Preord  ) 
ieOb(C) 

the  a -relation  arrow  class, 

ArRel (c ,a )  =  u  a(f)  s  Ar(Preord  ) 
ffiAr(c) 

and  the  signature  (object)  mapping 
o  :  Rel (c  ,a)  Ob(c  )  , 

where  for  any  r  c  Rel(C,a)  ,  o(r)  «  Ob(c)  is  that  unique  object 
such  that 


r  «  a(o (r) )  . 

In  particular,  note  the  special  case  (assuming  c  has  all 
finite  limits  and  products)  for  (c ,Sub)  ,  and  further  (SET, Sub)  , 
where  indeed  any  relation  r  is  such  that 

r  «  Sub(o(r))  *  »(o(r))  , 


i .  e.  , 


r  s  o (r)  . 

That  is,  a  relation  is  simply  a  subset  of  a  given  set.  For 
example,  let  X,Y  «  Ob(SET)  .  Thus,  X  x  Y  e  Ob(SET)  .  Let 
r  s  X  x  Y  be  arbitrary  fixed  (binary)  relation  relative  to  X  and 
Y  .  Thus,  o(r)  *  X  x  Y  here. 

Note  also  as  usual,  for  any  i,j  «  Ob(c)  and,  say, 
f  e  Ar^  j<c)  <  *(£)  *  Ar*(j)  »( i ) (Preord^ )  and  ^-relations 

r,s  «  *(j)  such  that  r  Sa(j)s  «  a(f)(r)  *(f)(s)  ;  a(f)(r)  , 

m(f ) (s)  «  a(i)  . 

Also  consider  the  following  basic  results: 

Cat(c,a)  S,B  .  Ar  ( c  )  Ob(c)  , 

Id  :  Ob(C)  -♦  Ar  (c )  , 

o  :  Ar ( c ) gXgAr ( c )  =  {(f,g)  |  f,g  «  Ar(c)  and 

S(f)  =  B ( g ) )  -  Ar (c )  , 


where  S 

is 

the 

source  or  domain  map 

B 

is 

the 

codoaaln  (s  range)  map  , 

Id 

is 

the 

identity  association 

map , 

O 

18 

the 

(binary)  composition 

map , 
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where  for  all  i,j  e  Ob(c)  ,  f  €  A^  ^(c) 

S ( f )  3  i  ,  B ( f )  =  j  .  ld{i)  e  Ar.(j(C)  , 
the  identity  arrow,  and  for  any  (f,g)  e  Ar(c)  x_Ar(C)  , 

O  ;  D 

°  (f.g)  =  f°g  .  arrow  composition. 


Basic  first  level  syntax  mles 

For  all  i  «  Ob(c)  ,  all  k  «  Ar(c)  ,  all 
(f,g)  e  Ar(c )sxBAr{C ) ,  (g,h)  e  Ar (c ) gxBAr (c )  , 


RT 

I , syn , 1 

RI ,syn, 2 

RI , syn , 3 
Rt 

I ,  syn ,  4 


S ( Id { i ) )  =  B ( Id ( i ) )  =  i  , 
k« (Id(S(k) ) )  =  Id(B(k) )  o  k  =  k  , 
S(fog)  =  S ( g)  ,  B(fog)  =  B(f)  , 

(  fog)oh  =  fo  (goh)  . 


X 

<,> 


:  Ob(C)  x  Ob{c)  -•  Ob(C)  (object)  product  map  , 
:  Ar (c )  X  Ar(c)  =  {(f,g)  |  f.g  «  Ar(c) 

s.s 

and  S ( f )  =  S { g ) } 


-*  Ar(c)  ,  arrow  product  map  , 

proj1  ,  proJ2  :  Ob(C)  x  Ob(C)  -♦  Ar(C)  ,  projection  map  ,  where  for 

all  f.g  «  Ar (c )  y  Ar(C)  ,  all  i,j  e  Ob(C)  , 

S.S 


RT 

I  ,syn,  5 

S ( <f , g> )  =  S ( f )  =  S(g) 

Rt 

I , syn, 6 

B(<f,g>)  =  x(B(f) ,B(g) )  3  B ( f )  x  B(g) 

RI,syn,7: 

S  (pro  jj  ( ix  j ) )  ■  S(proj2(ix  j)  )  =  i  x  j 

RI . syn, 8 ' 

B(pro jj ( lx j ) )  =  i,  B ( pro j  2 ( ix  j ) )  =  j 

RI , syn, 9 ' 

proJj(B(f )  ,B(g) )  °  <f,g>  =  f 

RI , syn, 10 ‘ 

pro j2 ( B( f ) , B ( g ) )  °  <f,g>  =  g 

R_  *  .  : 

If 

I ,  syn , 1 1 

B  (  f )  -  i  x  j 

then 


<Pro  j  1  (  i  ,  j  )  o  f,  pro  j  2  (  i  ,  j  )  °  f>  =  f  . 

Define  also 

x  :  Ar(C)  x  Ar(C)  -»  Ar(C) 

unres trie  ted  arrow  product  map,  where,  for  all  f.g  <=  Ar(C)  , 
f  x  g  »  <f  o  proj1(S(f ) ,S(g) ) ,  g«pro J2 ( S ( f ) , S ( g ) ) >  . 
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Next,  define: 

•[••]  e  Subst  :  Rel(C.a)  x  Ar(C)  Rel(c,a)  , 

o,B 

the  substitution  mapping,  where  for  any  (r,f)  «  Rel(c,a)  x 

o  , 

recalling  r  «  a(o(r)),  o(r)  €  Ob(C)  , 

T(r,f)  S  r[f ] 

=  (a( f ) ) (r)  «  a(S( f ) )  £  Rel(c.a) 

3  substitution  of  f  into  r  . 


The  following  properties  hold: 

For  (r,f)  «  Rel(c,a)  x  Ar(c)  and  all  (f,g)  «  Ar{c)  x 

o  ,B  B, 


RI ,syn, 12 
RI,syn, 13 
RI , syn, 14 


o(r[f])  =  S(f)  , 

(r[f ] ) [g]  =  r [ fog]  , 
r[Id(o (r) ) ]  *  r 


Also,  for  any  r,s  e  Rel(c,a)  , 


I , syn, 15 


then 


If 


r  *a(o(r))8  ' 
°(r)  *ob°{8)  ’ 


R 


I ,  Th ,  0  • 


and  if 


If 


( r , f ) , (s , f )  e  Rel(c,a)  x  Ar(c)  , 

a  ,  B 


r  Sa(o(r) )s 


then,  by  the  order  preserving  property. 


i  ,e.  , 


) (r)  *a(o(r) )*(f * (s’ 


r[f]  *a<a(r))8[f]  * 


In  particular,  note  the  special  case  a  =  Sub  (and  c 
all  finite  limits  and  products).  Furthermore,  for  the 
(SET, Sub)  ,  we  note  now  that  for  all 
( r , f )  «  Rel ( SET , Sub )  x  Ar ( SET )  : 

o  ,B 

r  «  Sub(o(r))  =  a(o(r))  , 


r  £  o ( r )  «  0b( SET)  , 
r ( f ]  =  f-1(r)  £  B ( f )  «  Ob ( SET )  . 


Ar (C  )  , 


Ar (c  )  , 
s 


having 
case  of 


i  .  e  .  , 
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Also,  note  the  terminal  arrow  maps 
pa  :  Ob { C  )  -  Ar  (C  ) 

where  for  each  i  «  Ob(c)  ,  {p  !i) )  =  Ar,  ,  (c)  ,  where  1 

a  i » i  a 

a 

terminal  object  in  Ob(C)  ,  and  hence 
R 


VI , syn, 16 :  S(pa(i))  =obi  .  B ( Pa { i ) > 

and  for  all  f  «  Ar(c)  with  B(f)  •  1 


sOb1a 


RI , syn, 17 :  f  _Arpa(S(f))  * 

Note  also  that  for  all  r,s  t  e  Rel(C,a), 
R 


I ,  Th ,  1 


I , Th , 2 ' 


r  ^afofr) )r 


{indeed,  r  =a(o(r))r) 


If 


then 

and 


ria(o(r))S  and  SS(o(s))t 


o(r)  =  o(s)  -  oft) 


r  S*(o(r) )to 


(Modus  Ponens)  . 

In  addition,  we  may  have: 

Loc(C.a)  T  :  Ob(c )  -»  Rel(C,a)  ,  truth  mapping 

where  for  all  i  «  Ob(C)  ,  all  f  «  Ar(C)  and  all  r  €  Rel(C 


I , syn , 18 ' 


I , Th , 3 ' 


I , syn , 19 ' 


o(T(i))  =obi  , 


r  *a(o(r))T(o(r))  ' 


T(B(f))[f]  -obT(S(f))  . 


&  =  a  :  Rel(c,a)  x  Rel(c,a)  -*  Rel(e,a) 
a  ,o 

conjunction  mapping,  where,  for  all  r,s,t  e  Rel(C,a)  ,  f  t 
with  o(r)  =  o(s)  =  o(t)  =  B(f)  , 


a  ( r ,  s )  =  r  a  s  , 


where 


I , syn, 20 ' 


I , syn ,21' 


o(tas)  =o(r)  =o(d)  , 
(r  a  s ) { f ]  =  rff]  a  s(f] 


is  any 


,») 


Ar(C  ) 
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RI,Th,4,5:  If 


r  s 


'*(o<r)) 


s  , 


then 


t  a  r,  r  a  t  s_  .  ,  .  s  . 

a(o (r) 


I,Th,6; 


If 


r  S*(o ( r ) )8 ’ ^ 


then 
r  s 


*{o{r) ) 


S  A  t 


l  :  Ob  ( c )  -♦  Rel(C,»)  ,  raise  mapping  where  for  all  i  e  ob(C) 
all  f  e  Ar(C)  ,  and  all  r  «  Rel(c,9t)  , 


R 


I , syn, 22 : 


*I,Th,7: 


I , syn, 23 ‘ 


o(l(i))  -  i 
i(°(r) )  *a(o(r))r 
KB(f))[f]  =  i(s<f))  . 


or  v  ;  Rel  (C  ,» )  x  Rel(C,3fc)  -♦  Rel(C,&)  ,  disjunction  mapping, 
o,o 

"bore,  for  all  r,s,t  e  Rei(C,»)  ,  f  «  Ar/Ci  with 
o(r)  =  o(s)  =  o(t)  =  B(f)  , 


(r,s)  ^  r 


v  s 


where 


I , syn , 24 


I , syn, 25 : 


o(r  v  S)  =  o(r)  =  o(s)  . 

(r  v  s)ff]  =  r[f]  v  s[fj  . 


RI,Th,9,lO:  If 


then 


r  **(°(r) )8  • 


r  5*{o(r) )8  v  t  ,  t  v  s  . 


RI , Th , 8 :  lf 


r  A  S  ,  r  A  t  S  ,  ,  ,,u 

a(o{r) ) 


then 
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r  a  (s  v  t)  s 


a{o  (r) ) 


(distributivity)  . 

»  :  Rel(C,»)  x  Rel(c,a)  -♦  Rel(C,a)  ,  Implication  mapping  is 

o  ,a 

defined  to  be  such  that  for  all  r  «  Rel(C.a)  and  the  mapping 

id_ ,  ,  a  r  :  a(o(r>)  -  a(o(r))  , 
a{o ( r) ) 

where  for  all  s  e  a(o(r))  , 

<lda(o(r))  A  r)(8)  *  8  A  r  ' 

(=*)  (r,s)  ^  r  =»  s 

3  (ld»(o(r))  A  r)"('1  ' 

where  the  following  diagram  commutes: 


f  id*/  ,  ,,Ar 

a(o  (r)  )  -  ( °  - *  *(o(r)) 


a(o  (r) ) 


a(o (r) ) 

(f)  I  I  »<*> 

a(S( f ) )  a(f  >  (g)=»a(f )  a(S(f ) ) 


O  (r) 


We  can  show,  for  all  r,s,t,u  «  Rel(C,a)  and  f  e  Ar(c)  with 
o(r)  =>  o(s)  =  o(t)  =  o(u)  =  B  ( f )  ,  corresponding  to  the  diagram 


*1 , syn, 26  * 
*1 , syn, 27 ‘ 


o(r  ^  s)  =  o(r)  ■  o (s)  , 

(r  9  s)[f]  ■  r[f]  »  s[f]  . 


In  addition,  we  require: 


RI , Th , 1 1 :  If 


r  *a(o (r) ) 8  and  ~a (o  ( r )  )u  ' 


(r  a  (s  »  t))  =sa(o(r))u  . 


RI,Th,12:  If 


then 


(r  a  s)  sa(o(r) jt 


r  *a(o<r))(s  *  t)  ' 
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Quan(C,a)  v  ;  (Ob(a)  x  ob(a))  x  Rel(e,a)  -♦  Rel(c,a)  universal 

(Id, a) 

quantifier  mapping,  where  for  all  i,j  «  Ob(c)  ,  for  the  r.  apping 
a(proj2(i,  j)  )  :  a(j)  -♦  a(lxj)  , 

and  all  r  e  Rel(C,a)  with  o(r)  =  i  x  j  (i.e.,  r  «  a(i  x  j  )  )  , 

v(i,j,r)  &  (tt(pro j 2 1 i , j ) ) )  (r)  , 

where  the  following  diagram  commutes: 

r  a ( pro j  ( ix  j  )  ) 

[  a(j)  - - - .  a(ixj) 

I  I 


a(ixj) 
a ( id( i )xf ) 

a( ixk) 


v  ( i  ,  j  ) 


v(i,k) 


a(  j) 

|  *(f) 

a(k) 


k 


We  can  show,  for  all  i,j,k  «  Ob(c)  ,  all  r,s,t  «  Rel(c,a)  , 
all  f  e  Arfc  j ( C )  ,  all  g  «  Arixj  k<c!  '  =  1  x  j  < 

o(s)»kxj  ,o(t)=J  ,  corresponding  to  the  diagram 


RI ,syn, 28: 

RI , syn , 29 ‘ 

In  addition, 


°  ( v  (  i  ,  j  ,  )  )  *  j 

v ( i , j , r ) [f ]  =  V ( i ,k, r [ Id ( i )xf ] )  . 

we  require 


If 

RI ,Th, 13 ; 


s [ <g , pro J 2 ( i , j ) > ]  s*(o(r))r  - 


then 


v(k, j.r) [proj2(i, j) ]  sa(o(r)) 


If 

RI ,Th, 14  : 


t [ pro j  2 ( ix  j ) ]  ^a{o(r))r  , 


then 


3 


sa(o(w)  )v(1,  J'r) 


(Ob(c )  x  Ob ( c ) )  x  Rel  (c ,a  ) 
•  Id  ,o 


Rel (c ,a) 


existential  quantifier  mapping,  is  defined  analogous  to  v  ,  as,  for 
all  i,j  «  Ob ( C )  ,  r  «  Rel(C.a)  with  o(r)  =  i  x  j  , 
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3(1. j.r)  *  (*(proJ2(lxj)))(r)  . 

with  associated  commutlvity  of  diagram,  implying 


I , syn, 30 ‘ 


I , syn, 31 ' 


o  ( 3  (  i  ,  j  ,  r )  )  *  j 

a ( i  ,  j , r ) ( f ]  =  3 (i,k,r[Id(i)xf ] )  , 


for  all  i,j,k  «  0b(C)  ,  all  r,s,t,u,v  e  Rel(c,a)  ,  all 
f  *  Ark,j(C3  •  a11  9  €  Arixj,k^c^  '  °(r)  =  1  x  J  -  o(s)  =  k  x  j  , 
o(t)  =  o  (u)  =  j  . 


RI,Th,15:  If 


r  Sa(o(r))t[pro;,2(lx:l)]  S(o(r))u[proj2(ixJ)] 


( 3 ( i , j , r )  a  t)  *a(0(r))  urproj2(ix j) ] 


RI,Th,16:  If 


r  Sa(o  (r)  )  t£<9'Pr°J2(lx;5)>3 


r  S(o(r)){3(i'J't)5[pr°J2(lxj)3  1 

Foun(C ,a)  x  :  Ob(c)  -*  Rel(c,a)  ,  the  equivalence  mapping  is  defined 
to  satisfy  the  usual; 

For  all  i ,«  0b(c )  , 


I , syn , 32 ' 


o(*(i))  =  i  x  1  . 


Also  define  the  associated  mapping 


:  0b(C )  x 


(Ar  (C  )  x  Ar  (C  )  -»  Rel  (C  ,*)  , 


id,Boproj1  ( S , B ) , ( S , B ) 


where  for  any  i,j  e  0b(C)  and  any  f,g  e  Ar^  A ( c ) 


f  xi  g  =  *1(f.g) 


=  x ( i) [<f ,g>]  , 


noting  here 


o(f  g)  =  j  . 


In  addition,  we  require,  for  all  i  e  0b(C)  , 

RI , Th , 1 7 :  T(i)  WId<i3  -i  Id<i> ) 


( ref lexivity ) 
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RI,Th, 18: 
RI,Th,19: 


(proj1{ixi)  «i  proj2(ixl)  -sa(ixl) 

(proj2(ixi)  proj:(ixi))  .  (symmetry) 


[(proj1(ixi)oproj1( (ixi)xi) )  «i  ( pro j2 ( ix i )opro j 1 ( ( ix i ) x i ) ) j 
^ - - - — - - >  \ _ _ _  _  , 

=  projj ( ixixi )  S  proj2(ixixi) 


A 


[ ( pro J2 ( ix i Joprojj ( ( ixi )x  1 ) )  * 

' - v - - 

=  pro j2 (ixixi ) 


( pro  j  2 ( (ixi)xi) j 

w - v - / 

=  pro j3 ( ix ix i ) 


SR(  ixixi  ) 


[(Pfoj1(ixi)oproj1((ixi)xi))  ( pro j 2 ( ( ix i )x 1 ) ] 

V - - — \ - w - - 1 

=  pro ( ix ixi )  =  pro ( ixix i ) 


( transitivity) 

Clearly  can  be  replaced  by  *a(()  ,  using  (T)  ,  etc. 

In  addition,  we  require  for  all  r  e  Rel(c.a)  with  o(r)  =  i  , 
all  f  e  Ar(i,j)  ,  all  i,j  e  Ob(C ) 

RI,Th,20:  ((projjdxi)  (proj2  ( Ixi  )  )  a  r[proj1  (  ixi  )  ] ) 

Sa(ixi)r[pro;32(ixi)  1  '  (substitution  for 

relations ) 

^I(Th,21"  ( ( pro j j ( ix i ) )  proJ2 ( ixi ) ) 

Sa(ixi)((fopro^(lxl))  (f°proj2(ixi)  ) 

(substitution  for  arrow 
composition;  also  the 
natural  generalization  of 
functional  extensions  by 
fuzzification) , 

( ( (projjUx  jxixj)  proj3(ix  jxix  j)  )a 

( (proj2(ix jxix j)  xj  proj4(ix jxixj)  )  ) 


Sa(ixjxixj)(ProJi2(ixJxlxJ)  “ix j  Pr°J34(ix^^3))  • 
Note  also  the  definition,  for  all  i,j  «  Ob(c)  , 

Proj^i)  S  id(l)  ,  Proj1(2(ixj)  =  Id(ixj)  ,  etc. 
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Need  also  to  consider: 


R  :  Ob(C)  -♦  Ob(C)  ,  exponential  mapping 

{•!••)  :  (Ob(c)  x  Ob(c)  x  Rel(c,a)  Ar(c)  , 

id  ,o 

class  {or  set)  abstraction  mapping, 

s  :  Ob(C)  -*  Rel(C,a)  ,  membership  mapping  with  basic 
properties . 

For  all  i,j,k ,t  e  ob(C)  and  all  r  e  Rel(C.a)  with 
o(r)  =  i  x  j  ,  all  f  e  Ar  .(C) 

Z  •  J 


RI,syn,33:  S^1  I  J-r>)  =  3  <  B({i  |  j,r})  =  n(i)  2  n1 


RI,syn,34:  -  i  *  «  . 

RI , syn , 35 :  (1  I  I  <,r[Id(i)  x  f ] )  . 

li  addition,  is  associated  the  mapping 

«  •  0b( C )  x  (Ar (C)  x  Ar (C ) )  .♦  Rel(c.a)  , 

(  id,BoProj1)  (S.tJoB)  f  (S,B)  ) 

where  for  any  i.j  «  Ob(C)  and  any  f  6  Ar,  .(c)  ,  g€  Ar  (c) 

1,3  i,nJ 


«j<f'3)  =  f  9  =  € ( j ) [<f ,g>3  «  *(i)  , 


i  .  e  .  , 


g)  =  i  . 

Also,  for  all  i,j,k  «  Ob(C)  ,  f,h  e  Ar(c)  , 
r,s,t,u  «  Rel  (c ,a)  ,  h  «  Ar^^^  ,  o(r)  =  ixj,o(s)=kxj, 

o(t)  =  k  x  j  ,  f ,  m  «  Ar  .(c)  ,  g  €  Ar.  ,{C)  o(u)  *»  j  . 

j.n 

RI ,Th, 22 :  If 


(r(<h,proj2(kx j)>]  sa(lxj)  h) 

then 


RI , Th , 23 : 


((h  €i  I  J'r>  °  Proj2(kxj))  sa(lx  h)  . 

If 


(t  *a(kx j)st<h-Proj2(kx j) >] ) 
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then 


t  S(kxj)s  ei((1  1  j'r} 


o  pro J2 (kx  j ) )  . 


(u[proj2(ixj)]  a  (proj1(ixj)  ( f°pro j2 ( ix j ) ) ) 

**(ixj) (pro^i(lx^>  € 4 (o°proj2 ( ix j ) ) ) 


and 


(nrnt-rtW*- j)  ]  A  (proj1(ixj)  *  ^  ( m«>  pro  j  2  ( lx  j  )  )  j 
S  (proj1(ixj)  « 1  (foproj2(ix j) ) ) 


then 


u  V1*1  • 

A  deduction  category  (C,a)  satisfying  properties 
Loc(c,3t)(T,  A»&,i=P,v  =  or,»)  ,  Quan(e,a)(v  ,  3)  , 

Foun(c,a)  (*,  a,  {-!••>,  « )  is  called  a  formal  topo s. 

It  follows  readily  (see  Coste  [41],  pp.  9,  10)  that  letting 
*  =  Sub  ,  (c,Sub)  is  a  formal  topos  Iff  c  is  a  topos . 

Note  the  following  specialization  as  an  example  of  a  formal 

topos 


(c ,a)  =  (Higg(H) ,Sub) 

Note  also,  e.g.,  for  (Fuz(H),Sub)  noting  Fuz(H)  is  not  topos, 
as  mentioned  before,  that,  e.g.,  RT  a  >  is  satisfied  for  a 

1 , AH ,4,0 

interpreted  as  ordinary  minimum:  (y  i  T  )  implies 

r  s 

(y  a  f  *=  y  a  i fir)  polntwise  over  X  ,  where 
t  r  r  v  s 

T  ,  y  ,  T.  :  X  -*  [0,1]  for  some  X  «  Ob(SEX)  .  (Abusing  notation, 
r  s  t 

x  =  o(r)  =0(8)  «  O(t)  .  ) 

Similarly,  a  interpreted  as  ordinary  minimum  satisfies 
RI,Th,6 

(y  if,  y.  )  implies  (y  i  r  a  y. )  point-wise  over  X  . 
r  s  r  r  s  t 

But  ,  a  interpreted  more  generally,  such  as  in  t-norm  form  <f>& 
will  not  satisfy  Rj  Th  g  since  i  min  =  a  pointwise  over 

[0,1]2  and  indeed  if 

y  =  y_  s  y.  ,  polntwise  over  X 

S  T  L 

then  choose  any  t-norm  <}>^  <  a  (such 
general , 


V 


<  y 


as  prod,  e.g.),  whence,  in 
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Similar  remarks  hold  for  some  of  the  rules  for  v  ,  v  ,  3  ,  =»  . 

Thus,  the  concept  of  a  formal  topos  must  be  modified  to  allow 
natural  evaluations  for  more  general  negations,  t-norms,  t-conorms, 
in  the  class  of  generalized  set  systems.  This  remains  at  present  as 
an  open  problem.  One  could  state  there  are  enough  difficulties  even 
with  Fuz(H)  and  =  a  =  min  ,  $or  =  v  =  max  ,  <J>  =  1  -  (•  )  , 

since  Fuz(H)  is  not  a  topos  and  hence  (Fuz(H),Sub)  is  not  a 
formal  topos,  since  Fuz(H)  -  although  a  category  and  hence 
(Fuz(H),Sub)  is  a  deduction  category  -  lacks  a  subobject  classifier 
n  .  (See,  e.g.,  Carrega  [29].) 

Nevertheless,  semantic  evaluation  can  be  carried  out  into  gen¬ 
eralized  set  systems,  but  the  soundness  and  completeness  property 
may  not  in  general  hold. 
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Appendix.  3 

Extensions  of  deductive  categories  ((41),  Chapter  III) 

This  appendix  establishes  a  category  structure,  extending 
Zadeh's  and  others'  "fuzzification"  or  generalization  of  ordinary 
sets  and  functions  and  relations  on  them. 

The  BSnabou  constructions  imbed  deduction  categories  within 
categories,  and  formal  topoi  within  topoi,  in  a  way  so  that  any  re¬ 
sults  Involving  deduction  categories  (or  formal  topoi)  may  be  -  in 
a  natural  way  -  re-interpreted  in  terms  of  (ordinary)  categories 
and  topoi.  The  latter  two  structures  have  received  more  attention 
in  the  literature  involving  formal  language  and  set  theory  (for 
example,  Johnstone  [127])  than  the  former.  Furthermore,  by  spe¬ 
cializing  these  constructions  over  (SET,H‘)  ,  the  categories  S-.o'H' 
and  Gog(H)  are  obtained  representing  generalizations  of  Zadeh's 
fuzzy  set  theory.  (See  section  2.4.)  In  addition,  Higgs'  and  other 
extensions  are  considered. 

In  the  following  results,  we  refer  to  deduction  categories 
all  having  syntax  structure  furnished  by  Th  plus  possible 

*  o 

additional  syntactic  structure,  depending  on  the  additional  pure 
theoretical  structure  required,  as  presented  in  2.2.2  (and  a  basic 
concrete  example  in  Appendix  2).  through  the  corresponding  axioms 
and  rules.  If  a  deduction  category  satisfies  "all"  additional 
structure,  it  is  a  formal  topoa  (i.e.,  T,  a,  1,  v,  >* ,  »,  3,  n, 
making  up  Foun(C,a),  Loc(C,a),  Quan(e.a));  if  it  satisfies  addi¬ 
tionally  all  but  n  (i.e.,  T,  a,  1 ,  v,  * ,  » ,  3)  ,  it  is  called  a 
formal  logos ;  if  it  only  satisfies  additionally  (T,  a,  *,  3)  ,  it 
is  called  a  formal  regular  category;  and  if  it  satisfies  addition¬ 
ally  merely  (T,  a,  * )  ,  it  Is  called  a  formal  finite  limit  category. 

It  follows  that  (extending  the  previous  result  at  the  end  of 
Appendix  2),  assuming  c  is  a  category  with  finite  limits  exist¬ 
ing  : 


(C , Sub) 
category; 

is  a  formal  regular 

category 

iff 

(C , Sub) 

is  a  formal  logos  iff 

c  is  a 

logos ; 

(c , Sub) 

is  a  formal  topos  Iff 

c  is  a 

topos . 

(i)  BSnabou 

type  I  construction 

is  a  regular 


Let  (c,a)  be  a  deduction  category  with  additional  property 
Then  define  category  y^(c,st)  as  follows: 

obfy^e.a))  3  <(i,T)  I  1  €  Ob(c),  y  e  a<i)>  a 

Ar(f  (c,a))  =  u  Ar  (y  (c.a)) 

(i,*),  (j.e) 

( (i,*)  ,  ( j,9 )«  Ob(rx(c ,a) ) 

(1  ,r).  (3,8)  €  ob(y x (c ,a) )  , 


where  for  any 


276 


Goodman  and  Mguyen 


Ar 

(i,T).(3,9) 


I'' 


,*))  “  {((f)) 


T  ,e 


|  f  «  Ar,  , (C )  and  f 
a  >  J 

satisfies  condition  { G1 ) )  , 


where 

(Bl) 

T  *a ( i )  8 [ f  1  - 

and  where 

((  ))fi( 

is  an  equivalence 

relation  (depending 

i , j )  upon 

Ari(j(c] 

\  ,  so  that  for  any 

f ,g  e  Ar ^  ^ (C  )  , 

Uf))r>9  = 

l(g>>*.a 

iff 

(B2)  r  **(1)  • 

Composition  °  is  defined  by,  for  any 
( 1  )  .  (j,«)  ,  (k.A)  «  Ob{yi(c,a))  ,  f  e  A^ (j(C)  ,  he  Ar  fc(c)  , 

ho  f  «  Ar  A  ^ { C }  ,  and 

((h)), ,,•((()!,  ,  «  (<h»f)>r  A 

is  well-defined,  since 
8  **(1)  A[h] 

implies  ,  by  RT  Th  Q  (substitution)  and  Rj  gyn  13 
»[f]  *a(i)  AthHf]  =a(i}  A[hof] 

and  since 


'  Sid  “<fi 


by  RIiTh,2  (modus  ponens) 


y  *a(i) 


Similarly,  we  can  show  the  above  holds  for  any  h  replacing 
h1  ,  f‘  replacing  f  satisfying  analogous  of  (B2)  ,  etc.,  as 
well  as  associativity. 

the  identity  arrow  Id,,  is 


For  each  (i,T)  e  Obfy^c,®)) 


merely  ((Idj))^  ^ 


where  Id, 


(l.r) 

is  the  identity  arrow  for  i  in  c 


r  = 


a(i)  ?[IdiJ  '  bV  Ri ,Syn, 14 


Theorem  1 

Let  (c,a) 


be  a  formal  finite  limit  category.  Then  y  (e,a) 
is  a  category  with  finite  limits,  ( ^ T ( c ,a) ,Sub)  is  a  formal  finite 
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limit  category,  and  there  is  a  universal  arrow  which  is  order  pre¬ 
serving  ,  etc . , 

(F(c,*)'  a(c,%))  €  Ar(c  ,a)  ,  (J’I(c  ,»)  ,sub)  (DeCat)  ' 
where  F , .  _ ,  :  c  -*  F  (e.at)  is  the  faithful  functor  determined  by 

(a)  For  all  i  «  Ob{C)  , 

F  a)(l)  3  (l.T(l))  «  ob(»l(e,*))  . 

(b)  For  all  i.j  e  Ob(C)  ,  f  e  Ari;J(c)  , 

F(c,a)(f)  =  (  (f)  ,T(i)  ,T(  j)  €  ArP(Cia)(i).F(ca)(j)(:fI(C'a))  ' 


and  where  the  natural  transform  a,.  ^ .  :  a 

(c  ,a) 


Sub«>Fjc  is  given 


i  .  e .  , 


For  all  i  «  ob(C )  , 


“(c.*)11’  e  Ar*(i),Sub.F(<.  ■ 


where,  for  all  r  «  a(i)  ,  noting  that  F  T(i)  ,  and  hence, 

for  (Id,  :  i  -•  i)  the  identity  arrow  on  i  in  Ar(C)  ,  we  have 

a(c ,a) (i) (r)  =  ((Idi))f,T(i)  €  Ar(i,F) , (i,T(i) ) (yi(C ’a) 5  • 

and  hence  a  (C  (9l )  ( 1 )  (y  )  6  Sub°F(c  a } ( i )  4 

(See  Theorem  2  for  the  definition  of  the  term  "universal".) 


(11)  Benabon 


II  construct ion 


Now  consider  a  similar  -  but  differently  appearing  construction 
-  for  (c ,*)  being  a  formal  topos.  The  connection  between  type  I 
and  type  II  constructions  will  be  given  in  part  (ill)  : 

Let  (c,a)  be  a  formal  topos.  Define  category  as 


follows : 


ob(,iI(c.*))  =  ob(y 2 (c ,a) ) 


ArJ/jjfC.a)  = 


(i.F)  ,  (  j  ,«  )«Ob(y  I;r  (c  ,»)  ) 


Ar(l,F)  .  (  j,«  )  (iriI(C  '*>  }  ' 
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where  for  any  (i,*)  .  (j,«)  €  Ob(y (e ,a) )  , 

Ar ( i . r ) ,  ( j,e) (yIIfc'a>  > 

3  {(((A)))  |  a  €  »(ixj)  and  A  satisfies  conditions 

(FI),  ( F2 ) ,  ( F3 ) }  . 

(FI)  A  sa(lxJ)  ?[proJ1(ix j) ] ,  ® [projj(ix j) ] 

(F2)  *'fproj1(ixj)]  =a(lx;j)  (3k)  (ix  j,A  [projix;.  (kxix  j) ) ) 

(F3)  A  (projlx;j(ix  jxk)  ]  a  A  [projlx^(ixkx  j)  ] 

^sifixjxlc)  (pr°3  j(ix3xk>  pro  j  ^  (  ixkx  j  )  ) 

(abusing  notation)  . 

In  individual  variable  form,  equivalently,  we  have,  for  all 


x<*>  , 

yd)  ,  W(J)  ,  ,(k) 

(FI) 

a (<x( ,y( ^  5  >]  s 

nx(1)).  e[y(j)3 

1 

(F2) 

y(<x(i), y(j)>l  = 

(3z(k) ) (A (z(k) , X 

m  ,y( 

( F3  ) 

a  [<x(1)  ,y<;?)>]  A 

t\\  Ml 

A[<x'"',wv-">3  s 

,yUl 

where 

s  is  sa( • )  ‘ 

(((A)))-.  is  to  be  interpreted  as  the  equivalence  class  of 

r  ,  9 

a  ,  i .e. , 


all  A •  €  %{ ix j ) 

such  that 

A  '  ”*(ixj) 

A‘  **(i) 

a  and 

A  sa<i)  A'  • 

Composition 

o  for 

Ar(y n (c > 

arrows  (((A)))  «  Ar(i,y)(j,9)(:riI*C'a))  and 

( ( (P)  ) )° { ( (A  )  ) )  =  (((m°a)))  ,  where  noting  p  €  s(ixj)  ,  A  «  a(jxk) 

^°A  =  (3  J  )  (  i  X  k,  Mprojj^Ux  jxk)  3  a  a  [pro  jlx  j  (  ix  jxk)  )  ) 

=  (3  y^  ^  )  (a  (x*  ,y^  ^  )  a  p{  y*^(z^'))  e  a(i  x  k)  , 

and  where  for  any  (  i  ,  tp  )  €  0b(  *  j  ^  (c  ,» )  )  ,  the  identity  arrow  is 


Id 

4 


(i.v-) 


-  (  (  (if  [  pro  j  (ixi)  a  r  [  pro  j  (ixi)]  a  (proj  (lxi)  *  proj  (ixi))  ))) 
(l)i  (2)1  U)1  (2)1 


=  (((?(x(1))  a  riy1'1)  a  (x 


(1) 


.(1) 


,(D 


)  ))) 
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Theorem  2 

Let  (e,»)  be  a  formal  topos.  Then  f  is  a  topos, 

<y  (C ,a) ,Sub)  is  a  formal  topos,  and  there  exists  a  faithful 
(order  preserving,  etc.)  universal  arrow 


(G(c ,%) ’p(c  ,*) )  e  Ar(c ,a) , (y IX(c ,a) ,sub) <DeCat) 


where  G,.  .  :  C  -♦  ?TT|C,»)  is  the  faithful  functor  determined  by: 

(a)  For  all  i  e  Ob(c )  , 

G(e  m)(i>  -  (i.T(i))  e  0b( y TT (c ,» ) )  , 

(b)  For  all  i.j  e  Ob(C )  ,  f  e  Ar  ^  ^(C)  , 


d 


6 


(((proj  (ixj)  (foprojA(ix j) )  ))) 

(  (  ( y ( *  *  j  f(x(i))  ))) 


'(C  .a) 


(!)  ,G 


(c  ,a) 


(j)(yil(C'a>) 


and  where  the  natural  transform 


:  *  ”*  Sub°G(c,a)  is  9iven  as  : 

For  all  1  e  Ob(C )  , 

V.*)11’  *  ir!,(i),SuboO(c  ,,)U)(PrEOr,?!  ■ 

I  ^  , 

/»(c.a)(i)  :  a(1)  -  Sub^G(c  ,a)  ^ 1  ^  ' 
where  for  all  r  «  »{i)  , 

^(C ,a) (*)<*) 

-  (  (  (F  [  pro  j  (ixi)]  a  ( pro  j  (ixi)  *  proj  (ixi))  ))) 

( 1 )  i  ( 1 )  i  (  2 )  i 

€  Ar(i,V’)  ,  (  i  ,  T  (  i  )  )  (yiI(C  ,a)  )  ' 
and  hence, 

*«:.», <i><r>  e  Sub(G(c  #a)  (i) )  . 

By  (G,-  n  \  )  being  universal  we  mean  that  for  all 

.  \  ^  l  ^  J 

(C1,*1)  6  DeCat  and  each  (G',/3')  e  Ar  (DeCat)  , 

(c ,*) , (c • .*■ ) 

I 
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there  is  a  unique  (G",/J")  e  Ar  (DeCat)  such 

{S-II(C  ,%)  ,Sub)  ,  (C  *  ,%•  ) 

that  the  followng  diagram  commutes: 


(c • ,a- ) 


Remark  1 . 

Note  that  if  (C ,»)  is  a  formal  regular  deduction  category  or, 
further,  a  formal  logos,  then  the  construction  r  (c,a)  remains 

valid.  In  turn.  Theorem  2  holds  with  the  appropriate  modifications 
("formal  regular  deduction  category"  and  “regular  category",  used  in 
place  of  "formal  topos"  and  "topos",  respectively,  for  example). 


(lii)  Connection  between 


constructions 


The  following  result  shows  that  for  ( C  ,  st  >  a  formal  topos, 
type  I  constructions  may  be  considered  a  subclass  of  type  II 


Theorem  3. 

Let  (C,a)  be  a  formal  topos.  Then  there  is  a  faithful 
functor  (i.e.,  a  one-to-one  injection  which  is  a  functor) 
g  :  f  ^(0  ,s)  -*  *  (c,»)  .  More  specifically,  we  may  choose  g  as 

follows : 

For  any  (i,r),  (j,e)  «  Ob(rI(c,a))  ,  and 
<  <  f ) >  v  ,  a  €  Ar(i,t) , ( J,e ) (yi(c  where  f  e  A^  j(c)  , 

g((i,T))  =  ( i ,r )  ;  g ( <  j  ,  ) )  =  (j,&)  , 

S(((f))y_8)  ^  (((g(f)))),|9  . 

where 


yU), 
yix  j 


f  1 

lXix jJ 


8  r v*^ H 

lyixjJ 


recalling  the  notation 


,  -t  My  n 

■  J  I  '  J  / 


„U)  d  ,  (4  .. 

2  ix  j  - 


Proof : 

First,  it  follows  easily  that  for  any  fixed 
(i,T),  (j,9)  €  Ob(^I(c,a))  and  any  f  «  Ari  j(c)  .  that  g(f) 

satisfies  properties  (FI),  (F2),  (F3)  iff  f  satisfies  property 

(  B 1  )  . 
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Next,  we  can  show  9  is  one-to-one  w.r.t.  ((  ))  and 

(  (  (  )  )  )  : 

Noting  that  since  is  a  formal  topos,  the  followina 

axioms  and  rules  invoked  are  valid: 

Let  f,g  «  ArA  j (C )  with  ((f)), ((g))  e  Ar{ A ,  ( j  ,  g > (y 1 <c ) ) ■ 

and  suppose  <3  ( f )  ^ag(g)  .  Thus,  trivially,  g(f)  £a  g(g)  and  by 

(  4  V 

use  of  the  substitution  rule  (R  )  ,  replacing  y ' J '  by 

X  f  1  il  i  v  XX  J 

g®x|^l  everywhere  yields  (using  now  simplified  notation) 

J 

((f(x)  «  f  (x)  )  Air  [x]a8  [f  (x)  ] )  (  (f  (x)  -  g(x)  )av[x]a0  [f  (x)  ] )  . 

But  since  (by  Rj  Th  17  and  the  definition  of  T) 

T  -  (f  (x)  *  f  (y.)  ) 

and  hence  by  R^  ^  and  Rj  Th  for  any  k  e  st(ixj)  , 

K  A  T  K 

St 

and  by  (6) ,  since  k  k  and  k  s  T  , 

*  *31  *  A  T  » 

and  hence 

k  =  *  a  T  .  . 

Thus  , 

(f(x)  «  f  (X)  )A1P(X]A0  [f  (X)  ]  =a  T  [x]A0  [  f  (x)  ] 

and  hence  by  R Th  2  (modus  ponens)  , 

ir  [x]  a8  [  f  (x)  ]  (  f  ( x )  *  g(x)  )Ay[x]A0  [f  (x)  )  . 

Then  by  use  of  Rj  5  and  R^  2  (modus  ponens)  we  obtain 

(a)  y[x]  a  9  [ f ( x ) ]  s%  (f(x)  *  g(x))  . 

Now  note  the  following  theorem. 

For  any  *  a  «  st(ixj)  , 

K  A  iff  K=a*AA 

(See  Theorem  1  (ii),  section  2.2.2  (VI).) 

Now  sin^e  ((f))  «  Ar(1  r)  g  ^  (f2(c  ,*)  )  ,  then  condition  (Bl) 

holds : 

y(x]  s  9 [ f ( x ) ]  . 

Hence  applying  the  above  theorem,  where 
«  =  y(x]  and  a  =  eff(x)]  ,  we  have 
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(b)  f [X]  =a  Y[x]  A  0 [f (X) ]  . 

Hence  combining  (a)  and  (b)  by  Ri  xh  2  '  we  ^ave 
P[x]  sa  (f (x)  *  g(x) )  , 

i.e.,  (B2)  is  satisfied  by  f  and  g  and  hence  by  definition 

( ( f ))vg  =  ((g))Vig  . 


( vi )  Higq's  construction  extended 

Let  (c,a)  be  a  deduction  category  with  additional  properties 
where  required,  beginning  with  a,  v,  X,  T  , 

Define  ym(c,a)  as  follows: 


(HI) 


ObC^ji  (C.a)  =  {(i  ,r)  |  i  e  Ob(c),  r  e  a(ixi)  with  f 

satisfying  (HI),  (H2)}  , 

r[<x(i) ,y(i)>]  -a(lxlxi)  r[<y(1).x(1)>] 


(i.e.,  Rj  Th  18  '  sYnune'trY<  holds  for  *  replaced  by  r )  , 

(H2)  ir(<x(1)  ,y(1)>]  a  r  [  <y ( 1 )  ,z{  1  )  >  ]  s  r  [<x( 1 } ,z( 1 } 

(i.e.,  Rj  Th  lg  ,  transitivity,  holds  for  *  replaced  by  r ) 


xU)  =  xj1]  .  ,  y 

ixixi  1 


U>  = 


y :  .  .  ,  z '  '  =  z 

1  ixixi 


U)  =  ,U) 

ixixi  ’ 


(y  HI  (c  >a)  )  u  Ar  .  i  .  .  .  .  (f  (c  ,a)  )  , 

fall  (i,r),(j,9)i  111 

«  Ob(yJII(c ,a) ) 


where  for  any  (i,r)  ,  <j,e)  e  ob(rni  (c  ,a)  )  , 


Ar(i,r),(j  e ) *yiII '* ^  =  (f  |  f  «  a(ixj)  and  f  satisfies 

(H3),  ( H4 ) ,  ( H5 ) ) 

(We  assume  *{•  )  -  equivalence  here  as  in  type  II  constructions.) 
Simplifying  notation  somewhat 


( H3 )  yt<x(1) ,y(1)>]  a  f [<y(i) ,z(i)>]  s  f [<x(i) ,z( J)>]  , 

f[<y(i)  ,z(^>]  a  0  [  <z(  D  ,w(  ^  5  >  ]  £  f  [<y(i)  ,w(^>]  , 

(i.e.,  Rj  20  ,  substitution  for  relations,  holds  for  f  and 
x  replaced  by  T ) 

( H4  f(<y(1)  ,z(:i)>]  a  f(<y(1)  ,w(j)>]  S  0  [<z(j)  ,w(j)>]  j 
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{ H5 ) 


f[<y(1),g>3)  =  DfrHy11'] 


fall  g  «  Ar(C ) 
=  j 


•] 


(i) 


-  *  [ <y  ^  )  ,y  ^  )  >  ]  , 


where  D(*)  «  Sk(i)  is  the  diagonal  part  of  V  . 

Composition  «  for  yni(c'a)  is  defined  by,  for  any  (i,if), 
(j,0),  (k,A)  «  Obfyjjjfc  ,a)  )  and  f  e  Ar  ( ±  f )  .  (  ^  g  }  (r  ZII  (C  ,a ) )  , 

g€  Ar( j ,9 ) , (k,A ) (ylll (c 'a) 1  ' 


g°f  =  ( g° f ) [ <x^ 1 ^ , z ^ > ] 

a  v 

fh  «  Ar  (c  )  ,  I 

lB(h)  =  j  J 


(f[<x(i),h>]  a  g[<h,z(k)>])  . 


For  each  ( i , tp  )  «  ob(rJII(c  »*)  )  -  Id  f  =  r  is  the  identity 
arrow  on  ( i ,r)  . 

It  easily  follows  that  for  any  (i,*)  «  Ob(fin(C,»)|  ,  (from 
(H2) )  , 

r  -  r[<x(1)  ,y(  1)>]  s  D(y)[x(i)]  a  D(ir)[y(1)]  and  (from 
(H4),  ( H5 )  )  for  any  f  e  Ar  (  (j,e)  (yIIj(C,*))  , 

f  -  f  [<x(i)  ,y(;3)>]  S  D(*)[x(1)]  a  D(»)[y(J)]  . 

( H5 )  are  equivalent  to  the  following 


and  in  particular,  for  any 


Note  that  (H2) 
composition  equations: 

*m(c,a)  is  a  category  under 

f  €  Ar( i ,r) , ( j ,« ) (yiII(C 'a) 1  ' 

Id , ,  «=  Id , j  „,of  =  t  ( ( H3 ) ) 

((HI),  ( H2 ) ) , 

together  with  (due  to  the  associativity  of  v  see  2.2.2,  VI), 


(i.n 


IdCM)°f  “  f 


Id(  i  ,ir )°  Id(  I ,? )  "  Id(  i  ,ir  ) 


fo  (goh)  «  (  fog)oh) 


all 


f,  g,  h  €  Ar(y ZIj (c ,*) ) 


well-defined,  the  usual  properties  for  arrows  in  a  category. 
In  addition, 

fop«f  s  r 

foD(9  )  =  U(T  ) 

where  (f°p  :  (j,9)  (I,?))  is  the  "opposite"  of  f  . 

For  any  (i,*”),  (j,9)  c  Ob  ( t  IIZ  (c  ,a )  )  ,  define: 


where 


IK*) 
(H  5) 
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xy,e  S  <f  I  f  «  Ari,j<c>  and  D(¥’>  S(i)D(0)[f3}  ' 

i.e  ,  £  «  gy  satisfies  (Gl),  and  hence  essentially,  g?  e  is 

the  same  as  Ar ( 4 f D (y , , , ( -j , D( „ > > j; <c ■* ) )  • 

Next,  define 

K,  ,e  =  (f  I  f  «  Ar^  j (C )  and 

*[<x(i) ,y(i)>]  s  « [<fox( 1 ^ , foy^ 1 J >] ) 
and  in  turn,  define 


{0(f)  |  f  e  3tr>#)  , 


where  here 


g  ( f )  =  f(foxli)  ,y(;j)  )  a  D(r)[x(i))  a  D(e)[y(;))]  . 

It  follows  immediately  that 

Ve  S  V*  £  Ari.j(C) 

and  by  using  (HI)  -  (H5)  , 

Sy,0  s  Ar(1  f  (  j  t0  )  (yI:rl(c  '*>  )  s  a(i*3)  • 

Now  define 

x  =  {(i,y)  |  (i,y)  e  0b(y (C ,a) )  and  there  is  some 

f  «  Ar(C)  with  B(f)  =  i  such  that 
D(V')[f)  =a(1)T(i)) 

the  class  of  all  normalized  (or  normable)  objects  of 

It  then  follows  that  for  any  (i,r)  and  (j,e)  e  x  , 

S3  *  3C..  „ 


and  hence 


0  -  v*  =  x*,*  =  Ari,j(C)  ' 

0  *  Sy.e  s  Ar(y,i)  ,  (a  ,  jj  (yin(c  >  s  a(ixJ) 


f  III,x(C  ,a)  “  (°b(yni,x(C  .a)  )  ,  Ar(yIII(X(c  >a)  )  )  is  a  well-defined 
subcategory  of  yIII(c,a))  where 


0b(yiII,x(C,9l))  "  *  ’ 


Symbolization  and  Evaluation  of  Language 


285 


Ar(:riII.Jf(C'*>> 


U 

over  all 
(i.n  .  (3.8  ) 
€  M 


Ar(i,y)  ,  ( 3.8  ) 


(*  JJ2 (c ,a) )  • 


Note  that  for  the  ordinary  Krbnecker  delta  <3  ( i )  «  a(ixi) 


where  for  all  f,g  « 

Ar(C) 

,  S(f) 

=  S(g) 

=  3 

G(f)  =  B ( g )  =  i  , 

'  T(i) 

iff 

f  =a(i) 

g  . 

o(i)  [<f  ,g>]  =* 

l  ( i ) 

iff 

f  **(i) 

g  , 

(i,0(i))  «  0b(Jf1IIfJ,(c,*)) 

• 

In  addition 

the  constant  relation 

T(ixi)  identified  as 

T(ixi) [<f ,g>]  = 

T(i)  , 

all 

f ,g  . 

III.JT 
(i.e.  ,  R 


is  such  that  (i,T(i))  e  Ob{y 
Suppose  now  (=») 

Then  for  each  ( i  , v )  e  Ob(y 
delta  is  defined  as 


(c  ,a) ) 


hold)  is  valid  for  (C,a) 


III 


I , Th ,11,12 
(C ,* )  )  ,  the  I ntu 1 1 1 on  I st l c  Kronecker 


ey  “  ey[<x(i) ,y(1)>] 

^  (D(y)[x(1)])  v  D(*)[y(i)])  »  r 

noting 

r  ■=  r[<x(1) ,y(i)>]  . 

Some  basic  properties  (see  also  Fourmann  and  Scott  [59]  and 
Scott  [ 226  1  ] )  are : 

For  all  (i,**),  (j,fl)  e  Ob(yj;iI  (c  ,a)  )  : 

(1)  r[<x(1) ,y(i)>]  s  e?[<x(i) ,y(i)>]  . 

Indeed , 

(2)  r[<x(1) ,y(i)>]  =  D[x(1)]  a  D[y(i)]  a  ey [ <x( 1  * , y ( 1 } > ] 

=  (D[x(i)]  v  D[y{i)])  a  ey[<x(1) ,y(i)>]  . 

(Proof.:  Theorem  9,  2.2.2,  VI,  with  a  =  D[x(1']  a  D[y^i)]  and 

b  =  if[<x<i\y<1)>]  ,  yields  S  for  (2)  .  Also,  Rj  Th  ,  where 
r  =  b  and  s  =  a  ,  as  above,  yields  2:  for  (2).) 

(3)  ey[<x(i) ,y(i)>]  =  T  , 


if 
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D[x(i)]  v  Dly(1)]  =  ,y(i)>] 

i  .e.  , 

D[x(i)]  =  D[y(1)]  =  v-[<x(i)  ,y(i)>]  . 

(Proof  is  immed4  te  from  Theorem  9,  2.2.2,  VI.) 

(4)  (i,e?)  e  Ob(J-Ii;[(c,a))  . 

Proof  that  transitivity  holds  for  »<  replaced,  b/  ey 
complicated.  Note  however: 

(5)  Define  for  any  i  «  Ob(C)  and  any  r,s  e  a(i)  , 


r  S»(i)  s 


(or  s  >a(1)  r) 


r  ia(1)  s  ,  but  r  *a{1)  s  . 


Then  we  can  choose  (compatible  with  R  ^  Th  11  and  Rj  Th  12) 


(r  =»  s)  ■ 


T(i)  ,  if  r  *R(1)  s 

8  '  if  r  Sd)  8  ' 


in  which  case  ey  simplifies  to 


%[<x(1,.Y(1,>] 


T  iff  D(y)[x(1)]vD(y)[y(;l)] 

=  r[<x(i) ,y(1>>] 
iff  D (TP)  [x(i)  ]  =  D(y)  [y(1)  ] 


r[<x^KY{i)>]  . 


?[<x(i) ,y(i)>]  ,  otherwise  . 


(In  this  case,  the  proof  of  (4)  becomes  easy,  by  considering 
ey  [  <x* 1  *  ,  y*  i )  >  ]  a  e?[  <y  ^ 1  ^  ,  z* 1  ^  >  ]  and  ey[  <x^ 1  *  , z  ^  1  ^  >  ]  ,  and  con¬ 

sidering  all  possible  cases  when  ey  =  T  or  <  T.) 

Note  the  following  special  cases  for  ( i , TP )  «  0b(y jjj(c >a) ) 

and  the  resulting  ey  : 

(i)  c?[<x(1) ,y(i)>)  ^  D(P)[x(1)]  a  D(r)[y(1)J 


.(i)  „<i> 


T  ,  if  D(P) [x(i) ]  =  D(f) [y(i) ] 

W  f  /v  m  „u>  4  *  n/w\rv  ( i ) , 


ee  [ <x  '  ,y  > ]  =»  ir  [  <x'  '  ,  y  >  ]  ,  if  D(r)[x'  '] 

r  *  D(r)(y{1)]  . 


Equivalently,  we  can  begin  with  any  q  «  a(i)  and  define 
tq  [<x(1)  ,y(i)>]  =  7[x(1))  a  /7(y(i)]  -  yielding 
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(i,e  )  «  Ob(yi;iI(c  ,%) )  and 


m  m  t  iff  /?[x(i)]  =  /?[y(i))  . 

ee  l<*  1  <Y  1  >3  =  e  iff 

Q  *1  /  i  \  m 


/?[x(1)]  *  f?[y(i)  ]  . 


*[<x^  ,y(i)>]  =  T  , 
e,C<X(1),Y(1)>l  -  T  . 


(iii) 


*[<x(i) ,y(i)>]  =  «(i)[<xti) ,y(i)>] 

.  [  T  iff  x{i)  =  y(i) 


i  iff  x 


(i)  „  V(D 


the  ordinary  Kronecker  delta  function  over  i  , 
ey[<x(i) ,y(i)>]  =  a(i)[<x(i),y(i)>]  . 


(6)  For  any  f  €  Ar 


(i,*),  CM)1  III 


(y_TT{C,a))  ,  e  and  e  satisfy 


(H3)  formally,  where  r  is  replaced  by  ey  and  0  by  ee  . 


( Proof ;  Consider,  e.g.,  ef  :  By  (H3)  and  property  (2)  above, 
D(*)[x(1)]  a  Dl'f)[y(1)]  a  ey[<x(i)  ,y(i)>]  a  f  [  <y(  15 ,  z(  3 )  >  ] 
s  f  [<x(1)  ,z(:i)>]  . 

But  since  f[<y(i) ,z(3l>]  S  D{V ) [y( 1 ) ]  ,  the  right  hand  factor 
is  absorbed,  and  since 

D(ir)[x(1)]  a  er[<x(i)  ,y(i)>]  a  f  [  <y{  1  *  ,  z { J }  >  ] 


e?[<x(i)  ,y(1)>]  a  f  [<y(i)  ,z(;5>>]  , 


the  desired  result  obtains.) 


(7)  Noting  that  x  s  x  we  let  g  :  Ob(C)  -  Rel(c,*))  be 

ey  '  ee  Y  ' 9 

arbitrary  fixed  so  that  for  all  i  «  Ob(C )  ,  g(i)  e  a(ixi)  so  that 
(i.g(i))  «  Ob(yjjj(C ,*) )  .  Define  then  the  contravariant  functor 

*  C  -»  Preord,  by,  for  all  i  «  Ob(c )  , 


*g(1)  ~  Ar(i,g{l)).(i,g(i)){ylll(c'a))  u  {eg(i)}  ' 
with  ag(h)  =  »{h)  ,  for  all  h  «  Ar(C)  .  Define 

x  *  u  x  .  Then  (Ob(c),x  )  is  a  category  and 

all  eg(i)'eg(j)  e 

i, j«Ob(c) 
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( ( Ob(c  )  ,3*e)  t*g) )  is  a  deduction  category  with  equivalence 

represented  by  eg  ,  i.e.,  eg  satisfies  Rj  Th  17  -  Rj  Th  21  with 

**  replaced  by  *  =  eg  (slightly  abusing  notation).  We  call  eg 

an  intuitionistic  equivalence,  due  to  the  structure  of  Thv  . 

At  the  formal  language  level,  we  may  consider  distinguished 
function  symbol  (g  :  Ob(Z)  -♦  Rel(z))  representing  "partial 
equivalence"  with  syntax  rule  added:  o(g(i))  =  i  x  i  and  with  * 
temporarily  omitted,  with  only  Rj  Th  lQ  and  Rj  Th  Jg  satisfied 

by  g  ,  replacing  *  ,  and  where  distinguished  function  symbol 

(Exg  :  Ob{i)  Rel(z))  representing  "existence"  or  "extent" 

Exg ( i )  3  D (g ( i ) )  3  g ( i ) [ <x( 1 ] . x( 1 ) > ] 

(where  Id^  ®  <x^i^,y^i^>)  .  In  turn,  define  equivalence 
*  :  Ob(X )  -  RelU)  by 

* ( i )  3  *(i)[<x(i),y(1)>] 

3  (Exg(i)[x(1))  v  Exg(i)[yU)]) 

*  g(i)[<x(i)  ,y(1)>]  . 

The  obvious  compatable  model  ii  il  for  t  v  preserving  these 

8ynK t & 

relations  yields  the  following: 

n«(i)  [<x(1)  ,y(1)>]ii  =  HEx  (i)[x(i)]»  "vii  hex  (i)[y(i)]H 
9  9  3 

=  li=»lt  llg(i)  [<x{1)  ,y(1)>]n 

=  eg(i)([<,,x(1,ll.  lly(i)ii>])  , 

iiExg(i)  [x(1)  ]n  =  ilExg(i)  [x(1)  ])i 

*  HD(g(  i)  )ll  [  <Hx* 1  ^  II  ,  lly^ll>]  , 

*  ii g( i )n  [ <n x(  1  *ii ,  iix(1)ii>]  , 

etc . 

Remarks  2 

Analagous  to  Theorem  3,  (C,s)  can  be  imbedded  within 

''ml'’"  ,Sub))  by  use  of  the  map 

*H(c,a)’  7(c,a)^  :  -*  riu^c'a)  where  for  all  i  €  Ob(c)  , 

V.*!111  i  (i'V 

and,  for  all  f  «  Ari  ,(C)  , 


xoi 
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H(C3i)(f)[<x(i)  ,y<j)>]  3  6<[<fox(1)  ,yU)>] 

and  SuboK(C  is  defined  as,  for  any  i  «  Ob(C)  , 

7 (c ,*)<*>  €  Ara(i),sub(i,ai)(Frenrd)  '  glven  b*'  for  an*  r  €  a(1)  ' 

7{e,»)(1)(r)  =  rtx<1)3  A  «1t<x{i).y(i)>3  • 

etc . 


More  generally,  we  can  replace  for  H . .  ,  the  relation  i-.fi.  , 

i  «  Ob(C)  ,  by  ,  e.g.,  g  ,  as  defined  earlier  in  this  section.  To 
what  extent  any  such  mapping  (whether  for  g  arbitrary  or  for 
g  =  fi_  )  is  a  functor  and  whether  is  a  topos  remains 

open  for  the  general  case.  For  the  particular  case  a  =  H  ,  see 
the  related  results  in  section  2.4  and  the  following  remarks  here. 


Remark  3 


All  of  the  previous  development  specializes  to  well-known 
structures  when  the  base  deduction  category  (c ,*)  is  chosen  to  be 

(SET.H' )  : 

Case  (1)  :  ?  ^  ( SET ,  h’  )  ,  f or  H  a  complete  lattice  such  that  *  is 

formally  treated  as  ordinary  equality  (see  Case  (2)  for  a  similar 
situation)  is  essentially  Gog(H)  ,  Goguen's  category  of  fuzzy  sets 
[96],  obtained  originally  as  the  characterizing  category  satisfying 
six  basic  conditions.  (See  section  2.4.2  D. ) 

Case  (2):  yjjfSET.H')  ,  for  H  a  complete  Heyting  algebra,  except 

for  having  *  formally  used  as  ordinary  equality  in  the  definitions 

of  Ar  ( y  j  j  ( SET , H*  ) )  (see  property  (F3)),  is  the  same  as  Eytan's 

category  Fuz(H)  ,  another  extension  of  fuzzy  set  theory  [56],  [57] . 
When  H  is  not,  in  effect  so  constrained,  but  rather  allowed  to 
have  its  "natural"  full  Heyting  algebra  structure,  including  * 
properly  interpreted  by,  say,  ex  ,  then  the  resulting  category, 

Fuze(H)  is  indeed  a  topos.  (See  2.4.2,  A,  B,  F. ) 

Case  (3):  y^jlSET.H  )  ,  for  H  a  complete  Heyting  algebra,  with 

all  of  its  intuitionistic  structure,  is  always  a  topos  -  Higg's 
topos  Higg(H)  -  and  indeed  is  the  Grothendieck  topos  of  sheaves 
Sheave (H)  [59],  [2261].  (See  2.4.2,  E,  F.) 

Case  (4):  The  relationship  ("representativeness")  mentioned  in 
Eytan  [56],  between  Gog(H)  and  Fuz(H)  is  a  case  of 

Theorem  3,  for  (c,a)  =  (SET,H')  . 

For  further  comments  on  the  above  categories  and  relations 
between  them,  see  section  2.4.2,  (F)  . 
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2 . 5  Probabilistic  and  approximate  reasoning 
2.5.1  General  comments 


It  appears  that  relatively  few  individuals  recognize  that  set 
theory  based  on  probability  logic  (a  multi-valued  logic)  is  the 
basis  of  standard  statistical  approaches  to  problems  involving 
uncertainty.  (See  section  2.3.9.)  With  this  in  mind,  it  should  be 
pointed  out  that  many  semantic  evaluations  exist  for  set  theories 
relative  to  logic,  including  classical  two-valued  logic  (the  deter¬ 
ministic  approach),  intuitionistic  logic,  and  Lukasiewicz  t^-logic 

(section  2.3.4).  In  addition,  why  choose  the  particular  logic  of 
probability  when  there  are  infinitely  many  alternatives,  some  of 
which  were  outlined  in  section  2.3.3?  Why  cannot  we  switch  logics 
for  various  problems  involving  uncertainty?  On  the  other  hand,  it 
has  been  shown  that  quite  different  appearing  set  theories/logics 
may  be  directly  related,  such  as  probability  logic  with  Lukasiewicz 
logic  or  with  a  variation  of  Lukasiewicz  logic.  This  has  been 
accomplished,  e.g.,  through  the  coverage  functions  of  random  sets. 
See  section  2.3.8,  Chapters  5,  6,  and  section  8.3.  Thus  these 
relations  embrace  the  choice  of  probability  logic;  however,  the 
latter  must  be  suitably  modified.  The  difficulty  in  choosing  a 
particular  logic  lies  essentially  in  the  choice  of  logical  operators 
comprising  the  logic.  Since  any  problem  can  be  described  by  an 
appropriate  formal  language,  one  can  use  the  data  to  guide  the 
choice  of  the  most  appropriate  logics.  (See  Chapter  9  or  section 
2.3.5. ) 

In  this  section,  we  outline  an  approach  to  the  manipulation  and 
explanation  of  uncertainty  in  knowledge-based  systems,  employing 
"fuzzy"  logic  as  a  basis  for  approximate  reasoning.  For  further 
background  motivations  and  various  applications  of  fuzzy  logic,  we 
refer  the  reader  to  Zadeh's  work,  e.g.,  Zadeh  [280]. 

We  can  view  fuzzy  logic  as  a  semantic  deduction  procedure  as 
opposed  to  just  deduction. 

Specifically,  let  l  be  a  formal  language,  il  •  li  be  a  fixed 
semantic  evaluation,  and  Loc(t)  a  fixed  logic.  We  assume  at  least 
for  the  present,  no  additional  theory  ThK(z)  .  In  particular, 

consider  =»  (Implication)  in  Loc(z)  . 

Define,  for  any  wff's  $,y  e  wff(t)  ,  the  deduction  sequent 

($  h  r)  or  |  iff  ll =»  yii  =  1  .  Note  that,  for  the  logics 
and  G^.  ,  k  =  2,3,...,*^  ,  $  V-  T  iff  n s  lint  (semantics  entail- 
ment).  (See  section  2.3.3.)  Similarly,  deduction  holds  for 

$r  <P2 .  ....  $m  y-  T  iff  11$ j  =»  m  *  1  ,  j  =  1,2,  ...,m  . 

Note  that  Zadeh's  fuzzy  logic  is  not  typically  formalized  in 
terms  of  a  formal  language  and  semantic  evaluation  framework.  In 
fuzzy  logic,  the  truth  values  -  as  typically  occur  in  multi-valued 
logic  -  in  the  unit  interval  are  replaced  by  linguistic  truth 
quantifiers  such  as  "very  true",  "slightly  false",  etc.,  and  the 
numerical  descriptions  are  in  general  replaced  by  linguistic  ones  - 
i.e.,  linguistic  variables  such  as  age,  with  typical  values:  young, 
very  young,  old,  middle  age,  ...  .  These  linguistic  values  in  turn 

have  numerical  representations  in  the  form  of  fuzzy  set  membership 
or  possibility  functions.  (See  Haack  [102]  and  section  10.2  A(I) 
for  criticisms  of  fuzzy  logic.) 
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At  this  present  stage  of  designing  knowledge-based  systems, 
statistical  reasoning  and  techniques  of  artificial  intelligence 
(Al-based  techniques)  are  the  two  main  approaches  to  the  analysis  of 
uncertainty.  Much  research  remains  to  be  done  in  order  to  see 
whether  or  not  a  combined  approach  of  statistical/probabilistic  and 
other  techniques  (e.g.,  AI  techniques)  will  lead  to  a  better  way  of 
modeling  of  expert  knowledge  systems.  (See  the  recent  paper  of 
Spiegelhalter  and  Knill-Jones  [246].)  As  far  as  meaning  represen¬ 
tation  in  natural  language  is  concerned,  an  attempt  will  be  made  in 
the  following  chapters  to  establish  some  connections  between  possi¬ 
bility  theory  and  random  sets. 


2-5.2  Approximate  reasoning 

Approximate  reasoning  (e.g.,  inexact  reasoning  in  clinical 
decision-support  systems)  is  commonly  used  in  knowledge-based  sys¬ 
tems,  in  particular  in  expert  systems.  In  general,  the  data  base  in 
a  knowledge-based  system  comprises: 

(i)  facts  expressed  by  sentences 

(ii)  rules  expressed  by  conditional  sentences  (where  con¬ 
ditioning  is  expressed  as  a  binary  logical  connective). 

All  sentences  in  a  knowledge-based  system  have  an  associated 
certainty  factor,  i.e.,  numerical  value  lying  in  the  unit  interval 
expressing  the  degree  of  confirmation  associated  with  the  sentence 
in  question  (See,  e.g.,  Shortliffe  and  Buchanan  [236].). 

The  basic  problem  in  the  analysis  of  uncertainty  consists  of 
assigning  an  associated  degree  of  uncertainty  to  conclusions  from 
various  combinations  of  hypotheses  from  (i)  using  (ii). 

Zadeh  argued  that  if  (i)  and  (ii)  are  imprecise,  due,  e.g.,  to 
natural  language,  then  the  computation  of  certainty  factors  is  not 
obtainable  in  classical  form.  But,  if  we  accept  a  weaker  replace¬ 
ment  -  linguistic  certainty  for  numerical  certainty  -  then  fuzzy 
logic  may  be  used  as  a  basis  to  carry  out  approximate  reasoning.  In 
the  latter,  classical  deduction  reasoning  is  carried  out,  modified 
by  the  use  of  imprecise  sentences,  and  thus  hypotheses  and  conclu¬ 
sions  modified  likewise. 

To  illustrate  the  use  of  fuzzy  logic  in  deriving  inference 
rules  for  quantified  propositions,  we  examine  below  an  example  from 
Zadeh  [289]  and  show  that  these  deductions  are  semantically  consis¬ 
tent  . 

Let  X  be  a  finite  set,  and  A  ,  B  ,  C  be  attributes  with 
domains  in  X  .  Let  Qj  ,  Q2  be  linguistic  quantifiers,  e.g., 

"most",  with  domains  in  [0,1]  .  We  write,  e.g.,  <|>A  for  the  mem¬ 
bership  or  possibility  function  of  A  ;  and  Q1  ®  Q2  as  the  product 

of  fuzzy  numbers  ,  an  extension  of  the  product  operation  on  ordinary 
numbers  -  considered  as  singleton  sets  (see,  e.g.,  Dubois  and  Prade 
[51])  . 

Consider  the  following  deduction 

P1  ^  ^1  ^'8  are  ®'s 

P2  ^  ^2  and  ®)'s  are  C's 

P3  “  *Ql  ®  ^2  ^  A's  are  and  C)'s  • 
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To  evaluate  P,  ,  P  ,  P  ,  we  use  the  approach  to  quantification 

1  t  J 

(section  2.3.8)  using  conditioning  with 


<t> 


■  =  l/card(X)  ,  <pr  =  a 

wt  * 


*  =  T  Ibndsum) 

Tor 


Let  u  =  X  (<f >.(x)  a  <$>R(x)  /  2  <t>A(x)  , 


X 


V  =  X  (<j)A(X)  A  <PB(X)  A  4»C(X))  /  Z  («J>A(X)  A  «t»B  (3C)  ) 

X  x 

W  =  X  ( <J>A<  X )  A  $B(X)  A  <t>c(x))  /  x  <|>A(X)  • 

X  X 


Now 


(i)  II P  u  =  $  (W)  3  sup  (t  (s)  A  1  (  t  )  ) 


VQ2 


rs,t 

lst=v 


=  sup  (<j>_  (s)  A  (w/s)) 
s  gl  42 

2:  <(»  (u)  a  <f>_  (v) 

v2 

*  ii  Pj  and  P2 II  . 


(i) 


Next,  if  z  =  x  ($  (x)  a  <J>  (x)  )  /  X  <f»A(x) 


X  X 

the"  S  'V*’  a  V'” 

Is  -  ti z 

i  sup  (4>n  (s)  A  A  (t)  ) 
fs,t  v2 

Is- t=z 

*  VQ2(W)  ' 


(ii) 


since  v  S  X  (<f>  (x)  a  <p  (x)  )  /  X  («|>  (x)  a  (pB ( X )  )  =  ^  and  w  =  uv  . 


x  ■  X 

The  inequality  (i)  means  that 

means  ( P 


(P1  and  P2)  I-  Pg  ,  and  (ii) 


3  i-  wff)  for  any  wff  whose  truth  value  is  ♦ioQ  ®q 


(z) 


See  Zadeh  [288]  -  [290]  and  the  development  of  the  calculus  of 
deduction  sequents,  similar  to  the  case  illustrated  above. 


Symbolization  and  Evaluation  of  Language 


293 


2 . 6  Examples  of  meaning  representation  of  natural  lanqnage 


This  section  through  a  large  number  of  examples,  extends 
Zadeh's  approach  to  modeling  natural  language  [15],  [277],  [279], 
[280],  [283],  [286]  -  [290]. 

The  basic  goal  of  modeling  natural  language  may  be  formulated 
as  the  determination  of  the  "most  appropriate"  mapping  p  :  f  9  , 
where  f  is  the  class  of  all  sentences  (or  open  wff's,  etc.)  of 
interest  and  9  is  the  class  of  all  formal  strings  or  wff's.  This 
is  based  on  the  results  of  the  rest  of  section  2.  (For  convenience, 
we  have  slightly  changed  some  notation.)  See  also  the  Appendix, 
Chapter  10. 


(i)  All  predicates  of  the  form  <x  e  A>  are  in  9  where 
x  e  X  and  A  is  any  fuzzy  subset  of  X  ,  x  ,  A  ,  X  otherwise 
arbitrary 


(ii)  If 

pl . pn 

€  * 

,  then  arbitrary 

(well-defined ) 

combinations  of 

€  *  . 

not  ,  & 

,  or 

operating  on  p^,.. 

.  , p  are  also 

n 

(iii)  If 

pl'p2  «  * 

,  then 

(Pj |P2)  e  9  where 

the  conditional 

operator  (•(••)  is  defined  implicitly  by 


Pj  &P2  =  (PjlPj)  &  P2  j 

®  jci.it  products 

and 


quant(p1|p2)  «  »  , 

for  any  allowable  quantifier  "quant",  such  as:  many,  few,  several, 
all,  sometimes,  etc. 

(iv)  If  p  «  *  ,  then  int(p)  e  9  ,  where  "int"  is  any 
intensifier  (or  extensifier)  such  as  "very",  "little",  etc. 

By  considering  a  large  number  of  sentences  empirically,  the 
above  four  conditions  appear  sufficient  to  characterize  sentences. 
These  cover : 

I  .  Mcdal/mood  operators  -  include: 

(a)  alethic  -  necessity,  possibility  (=  necessary  not), 
probably  likely,  etc. 

(b)  deontic  -  permission,  exclusion  -  may,  allow, 

imperitives/ jussives ; 

(c)  obligative  -  must,  ought  to,  should,  etc.; 

(d)  volitive  -  want,  hope,  expect,  wish; 

(e)  epistemlw  -  belief,  doubt,  knows; 

(f)  threat; 

(g)  potentiality  ,  etc. 
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All  of  the  above  forms  may  be  treated  by  suitable  indexing  in 
conjunction  with  the  compatibility  of  the  (mood)  index  with  the 
particular  type  of  mood  considered.  For  example, 

p(”John  wants  very  much  to  run") 

=  "(Or (John  e  runners^ )  &  (a  s  very ( volition) )) " 


where  the  degree  of  volation  is  measured  on  a  scale  from  0  to  1  . 

(See  also,  e.g.,  Snyder  [245]  for  development  of  modal  logic.) 

II.  Temporal  operators  -  past,  remote  past,  future,  completion  of 
action  vs.  incorapletion  of  action  -  "used  to",  etc.  -  may  be 
similarly  modeled: 

p("John  was  a  good  man") 

=  ''  Or  (John  e  (good  men)  &  t  e  was)", 
tiO  X 

t  being  the  time  index  (domain  R  ,  present  nominally  0)  "was",  a 

fuzzy  subset  of  R  ,  with  membership  function  being  decreasing. 

(See  also  Rescher  and  Urquhart  [215]  or  McDermott  [173']  for  de¬ 
velopment  of  temporal  logics.) 

III.  n-ary  relations  are  the  same  as  n-ary  verbal  forms.  For 
example , 

P ( "John  runs  quickly  to  the  store  for  her") 

=  "(John,  store,  her)  «  run  quickly  -  for" 

In  addition,  as  mentioned  previously,  for  each  choice  of  arbi¬ 
trary  fuzzy  set  system 

’  S  I  ♦not'  V  V 


there  is  a  truth  mapping 

II  •  II  :  9  [0,1]  , 

where  the  following  isomorphic-like  (or  commutative)  forms  hold  for 
any  p  «  9  ,  assuming  truth  functionality  , 


it  (  not  ( p )  )n  *  ^not(llpl1  ) 


M  &  p , ) » 


<P&  ( I'  P"  ) 
jeJ 


"  (Or  p  )!! 
jej  J 


'or 


ii  ( <x«  A> )  n  =  $A(x)  ;  all  x  ,  A  ,  X  ,  etc. 


Hence,  the  composite  mapping 


ii  •  II  op  ; 


f  -* 


[0,1] 
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is  a  truth  evaluat'on  mapping  from  sentences  into  their  correspond¬ 
ing  truth  values. 

Some  additional  comments  concerning  the  modeling  of  natural 
language : 

1.  Many  sentences  have  ambiguous  interpretations  which  can 
often  be  resolved  through  "disambiguating  “  context.  For  example: 

"He  uses  her  well."  (two  meanings) 

"John  found  a  book  on  Third  Avenue."  (three  meanings) 

As  Leech  points  out  ([153],  pp.  78,  79)  probabilistic  (or  for 
that  matter,  possibilistic  -  the  authors)  weights  could  be  assigned 
to  the  various  interpretations,  based  on  usual  understanding  of  the 
sentence.  See  Oden  [196]  for  an  extensive  fuzzy  set  analysis  of 
this  problem  and  an  excellent  survey  of  the  field. 

2.  The  cardinality  of  a  fuzzy  subset  A  of  an  ordinary  space 

X  , 

card(A)  =  2  <f>  ( x ) 

x«X 

has  played  a  large  role  in  symbolizing  quantifications.  (See,  e.g., 
almost  any  of  the  Zadeh  references.)  However,  as  natural  as  the 
concept  appears,  in  general,  it  is  not  representable  as  a  compound 
definition  based  on  a  given  fuzzy  set  system  (^not*  ,  cf>or )  as  is 

the  well-used  ratio 

card (A  OB)  /  card(B) 

*  r  /  2  4>g(X)  , 

x«  X  xe  X 

where  usually  $£  is  chosen  as  min.  (There  is  also  an  obvious 

tie-in  with  the  averaging  operator  and  fuzzy  set  partitioning.  See 
[91],  Theorem  6.5  and  Cor.  6.2.) 

An  alternative  to  the  above  approach,  using  only  the  basic 
fuzzy  set  operators  and  $or  is  given: 

Let  X  be  a  fixed  ordinary  set  and  let 
U  :  X  -  Y  and  v  :  X  ■*  Z 

be  two  mappings,  each  representing  a  linguistic  or  numerical  mea¬ 
surable  such  as  u  =  ht ( •  )  ,  v  =  age  ( •  )  ,  w  =  color  ( •  )  , 

x  =  distance  between  (• ,• )  ,  etc. 

Also  let 

q  :  X  -  [0,1] 

be  a  possibility  function  (or  in  particular,  a  probability  function) 
representing  the  weight  of  importance  attached  to  each  x  e  X  , 
corresponding  to  fuzzy  subset  wt  of  X  .  Let  C  and  D  be  two 
fuzzy  subsets  of  X  . 


Then  defining 
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S,,  S  ;  ;3x)  (x«X  &  u(x)  e  C  &  x  e  wt ) 

l,sc"  =  c<u<x>  >  >  >  * 


SC®D  *  (3x)(x  «  X  &  u(x)  €  C  &  v(x)  «  D  &  X  €  wt)  , 

ll(SC«D)n  =  ^or(<^&(^C(u(x>  *  '  $D(V(X>)'  q(x)))  • 


See  the  previous  analysis  of  conditioning  and  quantification  in 
sections  2.3.9  (B)  and  2.3.8. 

Then  analogous  to  the  development  of  conditional  possibility 
functions,  note  that  it  always  follows  (from  basic  properties  of 
t-norms)  that 

» <sc)H  *  "(sc®d),! 

and  it  is  natural  to  define  sD|c  implicitly  by 
SC®3  =  (SC  &  SD|C)  ' 

whence  (using  the  continuity  property  of  as  in  the  definition 

of  conditional  possibilities) 

"^cW"  =  ,I(SC  &  SD|C>" 

=  4>&(nsci.,  l»SD|cll)  . 

Note  (as  in  the  conditional  possibility  definition) ,  if 
<$>&  =  prod,  then 

"SD|C"  =  "W  /  "V  ' 

and,  if  further,  <t>Qr  =  bndsum  and  q  is  chosen  as  probability 
function,  then 

IIS  II  =  Z  $  (u(x))-q(x)  , 

C  x«X  C 

,ISC®d"  *X  <^c (u(x)  )  •  <J>D(v(x)  )  •  q(x)  , 
xe  X 

yielding  a  result  quite  similar  in  form  to  Zadeh’s  procedure  for 
forming  fuzzy  cardinalities  and  fuzzy  precentages  -  except  for  min 
being  replaced  by  prod  -  in  order  to  model  quantifications.  How¬ 
ever,  note  for  Zadeh's  fuzzy  set  system  ( 1- ( • ) , min, max)  , 


,,SD|Cn  =  "W  =  "SC|D" 
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■  max(min{4,c(u(x>  )  .  <J>n(c(x) )  ,q(x)  )  )  , 
x«X 

which  is  less  close  to  Zadeh's  fuzzy  cardinality  and  fuzzy  quanti¬ 
fication  modeling! 

3.  The  modeling  of  interrogatives  has  been  omitted.  (See  Zadeh 
[290']  for  one  approach  to  the  symbolization  of  questions.) 

4.  It  has  been  proposed  (L.A.  Zadeh,  1983,  personal  communications) 
that  quantifiers  are  essentially  generalizations  of  probability 
measures  and  hence  may  be  further  characterized  through  this 
approach. 

The  following  examples  illustrate  the  mappings  p  ,  li  •  ii  and 
i! -Hop  in  evaluating  a  number  of  different  sentences.  Note  that  in 
most  uses  of  quantifications,  we  employ  Zadeh's  approach  using  fuzzy 
cardinality  [277],  [283],  [286]  -  [290]  -  for  simplicity  -  rather 

than  by  use  of  the  above  more  complicated  (but  closer  to  first  prin¬ 
ciples)  ,  alternative  approach.  However,  one  example  illustrating 
both  approaches  will  first  be  given. 

For  further  discussion  of  the  formal  language  interpretation  of 
natural  language,  see  the  Appendix  at  the  end  of  Chapter  10. 

Note:  In  the  beginning  examples,  we  use  both  p  and  <[>  to 

indicate  possibility/membership  functions. 


Example  1 . 

S^d  nperhaps  there  are  a  hundred  happy  people  in  the  world" . 
Using  Zadeh's  approach: 


iiSjii  =  ♦perhaps ^♦*100^  *  ♦happy^31^^  ' 

where 

♦perhaps  :  ' 

♦«  100  :  no. 130]  -  [0,13  , 


and 


X  -  set  of  all  people  In  world. 
Using  the  alternative  approach  presented  above, 


l,Sl"  “  *or(V*«100<card<B))'*&  (*happy(x),)) 
BSX  xeB 

where  now 

♦*100  :  Set  °f  inteSers  -  [0.1] 


card(B) 


no . 


of  elements  ( i . e .  , 


cardiality)  of 


B  . 


and 
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Example  2. 

S2=  "If  visibility  is  very  good  then  most  likely  a  type  A 

submarine  will  not  be  found  in  area  III  over  the  standard  search 
period . " 

Suppose  (maximum)  visibility  is  defined  in  terms  of  miles  over 
a  possible  continuum  [0,mQ]  ,  where  mQ  is  some  sufficiently  large 

upper  bound.  Then  "at  least  good"  relative  to  this  scale  is  repre¬ 
sented  by  a  monotone  increasing  possibility  function  pQ  and  "at 

least  very  good"  is  represented  by  pVQ  monotone  increasing  with 
PVG  *  PG  - 

This  can  be  accomplished  simply  by  either  use  of  the  transforms 

PVG  ^  ^  =PG^’  +  et)  or  ^VG  ^  ^  PG  ^  ^  ' 

for  constants  a,p  >  0  ,  after  first  modeling  pG  .  In  this  case, 
PVQ  is  considered  as  an  intensification  of  pQ  .  On  the  other  hand, 
PVG  could  be  modeled  directly. 

"Most  likely"  is  a  truth  quantification  and  is  represented  by 
'  C^*l]  [0,1]  ;  ^  ^  *  all  x  €  [0,1]  , 

due  to  loss  of  truth  values. 

Let  X  be  the  set  of  ships  of  interest. 

For  each  time  t  ,  let 

PF  :  X  -  [0,1] 

*111,  t 

represent  the  fuzzy  set  of  submarines  found  in  (fuzzy)  area  III  at 
time  t  .  Also,  let 

PT  :  R+  -  [0,1] 

represent  the  standard  search  time  period  -  symmetric  and  unimodal , 
etc . 

"Will"  is  represented  by 

Pw  :  R+  -  * 

a  monotone  increasing  function,  with  R+  identified  with  time; 
present  time  being  0  . 

Some  fuzzy  set  system  ( <^not '  '  ^or  *  is  chosen>  If  (*)  then 

(•• )  -  implication  -  is  evaluated  through  binary  operator 

:  [0,1]2  -*  [0,1]  ;  $  =»  (u,v)  ^  ^0r  ^not (U^  ,V)  ' 
for  all  u , v  e  [0,1]  . 

PA  :  X  -  [0,1] 
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represents  type  A  submarines. 

Evaluating  the  truth  of  sentence  , 

I1S2»  =  ti  ( [If  (mi(vis)  e  VG)]  then  [ (ML) ( not { ( 9 t ) ( 3x) 

[(t  «  mw)&(x  €  AnFIII  t)]>)])n 

=  <}>  (H(mi(vis)  «  VQ ) ll  ,  (ML)  (not)  Or  Or 
*  t€R  X€  X 

II  (t  e  TnW)&(x  e  AHFi;i;i  t)ll) 

=  ^  (pVG(mi(vis) )  , 

PML(<*not  ^or(  (^or(^&(PT(t) 'Pw(t> 'PA(X)  ,PIII,t(x^  1  )  }  }  1 
t«R+  x«X 


Example  3. 

Sg  =  "Based  on  very  reliable  sources.  Ship  B  will  soon  arrive 

in  area  II  along  with  most  of  their  heavily  equipped  carriers  that 
are  within  about  100  miles  of  B  at  present". 

"Based  on  very  reliable  sources"  is  a  modification  of  truth 
increasing  curve  pR  ,  pR  :  [0,1]  -♦  [0,1]  ,  where  compared  to  pML 

in  Example  2,  we  have  for  all  x  e  [0,1] 

X  *  PR(X)  *  pML(x)  . 

This  could  be  modeled  directly  or,  analogous  to  the  situation  for 
PVG  vs  pG  ,  by  some  (perhaps  exponential  or  translational)  relation 

with  pML  or  some  common  base  such  as  the  identity  function 

id  :  [0,1]  -«  [0,1]  ,  id(x)  s  x  ,  all  x  «  [0,1]  ,  identified  as  the 
truth  level  function. 

Similarly  "will  soon"  is  a  modification  of  "will"  and  thus  we 
may  have 

1+n 

PWS  =  pW 
or 

pwe  =  pw(-  +  6)  * 

for  some  suitably  chosen  i ,0  >  0  . 

"at  present,  within  about  100  miles  of"  is  a  symmetric  fuzzy 
relation 


pQ  :  Dx  D  -*  [0,1]  , 

where  D  is  some  suitably  large  region  of  the  earth's  surface  mea¬ 
sured  in  latitude  ,  and  perhaps  the  shape  of  p„  could  be 

W  f  J  vU 

1  ,  if  Iix-yii  s  100 
1  _  ^-a-nx-yii2 


PQ(x,y) 


if  iix-yii  >  100 
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for  some  suitably  chosen  constant  a  >  0  . 

Thus ,  i f 

C(B)  =  set  of  all  heavily  equipped  carriers  within  about  100  miles 
of  B  at  present  that  will  arrive  soon, 

then  letting 

X  ^  set  of  enemy  ships  of  interest  , 
and  for  each  time  t  , 

ArTT  =  fuzzy  set  of  all  enemy  ships  arriving 

X  J.  ,  TI 

in  area  II  at  time  t  , 

and 

H  =  fuzzy  subset  of  X  representing  heavily  equipped  carriers, 
then  for  all  x  «  X  , 

<?>C(B)  (x)  =  11  (x  €  C(B)" 

=  it  (at)  (  (teWS)  &  (x  e  Arjj  t)  &  (x  e  H)  &  ((x,B)  «  Q))li 
=  V  (VPWS(t)'PArTT  <x).Ph(x),Pq(x.B)))  • 

t€R  I-t,t 

Then  the  (fuzzy)  cardinality  of  C(B)  is 

card(C(B)  )  =  X  <»>  (x)  . 

x«X  C(B) 

Also,  if 

E(B)  ^  set  of  all  heavily  equipped  carriers  within 
100  miles  of  B  at  present, 

then  similarly  for  any  x  e  X 

PE(B)(X)  =  VpH<x)'pQ(x'B,) 

and  we  can  show,  using  the  properties  of  <p&  and  <f>or  that  -  as  a 
check,  as  should  be  - 

PE(B)(X)  *  PC(B)(X)  ;  311  X  6 

1 .  e  .  , 

E(B)  2  C ( B )  , 

(one  version  of  fuzzy  subset  relations). 

card(E(B) )  =  z  PE(B) (x)  . 

X€  X 

Now,  "most"  may  be  considered  to  be  represented  by 

PM  :  [0,1]  -  [0,1]  ;  PM(x)  =  0  ,  for  0  S  x  <  ^  , 

and  over  [-i,l]  ,  pM  is  monotone  increasing  to  1  . 


X  , 


As  before. 
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Then  "most  of  their  heavily  equipped  carriers  that  are  within 
about  100  mile  of  B  at  present  will  soon  arrive  in  area  II"  becomes 
$M(card(C(B)/card(E(B) ) )  . 

Thus,  sentence  3  has  truth  evaluation 

11  S 3 '•  “  Pr(^&(^C(B)  (B)  ,<f|M{card{C(B)  )/card(E(B)  )  )  )  )  . 


Example  4. 


S4  =  "Intelligence  reports  that  during  the  past  month,  between  eight 

and  ten  of  the  largest  ships  spotted  in  region  I  were  observed 
to  be  carrying  a  total  of  perhaps  a  dozen  missiles,  some  of 
which  may  be  of  class  G;  this  could  indeed  lead  to  a  world 
crisis . " 


First  let  sentence  S4  be  broken  up  to  the  left  and  right  of  the 

semi-colon  as  (K;L)  .  Clearly, 

11 S4"  =  ^&(1,KI1  • 11 Ln  )  • 

Let 

X  3  the  ordinary  set  of  all  ships  of  interest  and  for  each  t  e  R  , 
S  =  the  fuzzy  set  of  all  ships  of  interest  spotted  in  region 

i  t  v 

I  at  time  t  ,  For  any  X  «  X  ,  suppose 

pT(x)  =  degree  to  which  x  is  large  , 

ii 

j 

f(x)  =  no.  of  missiles  x  is  observed  to  carry  , 

and 

g(x)  s  f(x)  ;  g(x)  =  no.  of  G-type  missiles  x  may  carry, 
f  is  known,  while  g  is  unknown. 

Then  for  any  integer  m  and  e  X  , 

Pp„  =  «'  - 


represents  the  fuzzy  time  period  "past  month", 

i 

I 


ps  <xi . V 


il  (x, . x  were  spotted  in  I  during  the  past  month))! 

1  m 


=  ♦or  (^(pPM(t)  '  pST  (*!>'•• 
t€R  1't 


I  ,t 


with  corresponding  ordering  by  size  indicated  by 

Pl(x(1)>  *  Pl(x{2))  *  •••  *  PL(x(m)5  • 

Let 

PF  :  {6,7,8,9,10,11,12}  -  [0,1] 


be  the  possibility  function  (peaking  to  one  jt  8,9,10)  representing 
"between  eight  and  ten". 

Similarly , 
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P»12  :  -  [0,1] 

represents  "perhaps  a  dozen"  ,  "Some"  is  represented  by 
pSom  .  [O.D  -  [0.1]  . 

generally  unimodal  and  peaking  to  one  prior  to  1/2  .  Note  that 

"intelligence  reports  that1'  acts  only  as  a  truth  modification  on  the 
rest  of  K  ,  and  may  be  represented  by  some  suitably  chosen 

pint  :  [0'1]  **  C°'1]  ' 


which  may  be  compared  to  the  truth  modifications  (or  intensifi¬ 
cations)  in  sentences  S2and  Sg. 

Thus , 


ii Kit  =  ii  (int^axj)- •  (3xb)  f{xJ( . . . ,xm)  s  X 

&{x, ,...,x  spotted  in  I  during  past  month) 

1  IQ 

n 

&  (3n)  ( 1  SnSm&(n€FISt(Z  f(x...)  e  12))  & 

i  =  l  m 

n  n  . 

((  z  g(x(1j)/  z  f(x(ij))  e  Som))))» 


pint  ^  ^or 


over  all 
(X., • • « rX  ) £X , 
mil 


^&tPSl<xl' 


'V 


*or(Wn)'  P-12<  V(X(*))'  PSo.<  *  «<*(i))>V1f(*(i))),n 

ISnSm  i=1  1-1 


)  ) 


Next,  note  that  "could  indeed"  is  a  modification  of  truth  which 
if  compared  with  "will"  and  "will  soon"  looks  like: 


Present  t  -♦ 


Let  "world  crisis"  be  represented  by 
pwc  :  Y  -  [0,1] 

where  Y  represents  the  states  of  the  world  which  for  simplicity 
could  be  n  by  1  vectors,  each  entry  being  some  measure  of  po- 
tentitial  crisis  such  as  "troop  buildup  in  area  IV",  "defense 
expenditures  over  time  "trade-level",  etc.  Then  for  any 

y  «  Y  , 


IMyjll  =  PCou(VW’  puc(yl)) 
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Example  5. 

S,.  »  "For  any  0  <  a  <  1  ,  let  be  defined  by  F2(Ca)  =  1  -  a  , 

where  F2  is  the  cumulative  probability  distribution  function  of 
2 

*2  (a  chi-square  random  variable  with  two  degrees  of  freedom).  Then 

decide  Hq  with  probability  t  and  Hj  with  probability  1-7 

iff  A  ( z )  <  C  for  test  statistic  A(Z)  ,  assuming  A(Z)  given 
a 

2 

Hq  is  distributed  as  x2  ;  otherwise  decide  Hj  with  probability 
one . " 

The  above  sentence,  a  combination  of  mathematical  and  logical 
terms  may  be  evaluated  a  few  different  ways:  Let 

P.<  .  P=  ••  *  *  *  -  [0,1] 


be  the  possibility  functions  representing  the  binary  relations  of 
(either  fuzzy  or  crisp)  inequality  and  equality  of  real  numbers. 

For  any  a(z)  z.  0  ,  let 


pd | a  (z)  :  }  ■*  C°' 15 

be  the  possibility  (in  actuality,  probability)  function  correspond¬ 
ing  to  the  fuzzy  subset  "decision  given  a(z)  observed". 

Then  one  interpretation  is: 

11  s«j"  =  <P&  ($&(P=  [  ,  F  2  ^  ( 1  -a )  }  ^  ^  ' 

0<a<l 

Qa  =  <f&  (1>&(Qla(A(z)),Q2a(A(z)))) 

(A (Z)i0) 

where 

Cia(A(z))  -  *^(p<(A(z),Ca),1)&(p=(pd|A(z)(Ho),7)  , 

P=(Pd|A(2)<Hl>'1“1')))  ' 

Q2a(A(2)>  “  ^«=,(1-p<(A<2)'Ca)-^(p=(pd|A(2)<Ho)'0)  ' 

P={pd|A(z)(Hl)'1,))  • 


Alternatively,  if  p<  ,  p_  strictly  represent  classical 
inequality  and  equality,  respectively. 


US, ii 
o 


il(Sa(a,A(z)))ll  , 

0  S  a  S  1  , 
A  (  Z  )  2:  0  , 


304 


Goodman  and  Nguyen 


where  for  any  0£a£l,A(z)£0, 
ii  { Sg (a , A  (z)  )  )ii  =1 
iff  a  and  A(z)  are  such  that 

C  -  F^d  -  a)  and 

a  2 

If  A ( z )  >  C  ,  then  f  pd|A(z)^H0^  =  7 

I  Pd|A(z)(Hl'  1  ~  7 
if  A(z)  s  C  ,  then  f  PdU(z)(H0)  0  ' 

“  1  pd|A(z,^Hl>  =1- 

II  (a  , A  ( z )  ) II  =  0  ;  otherwise. 


Example  6. 

S-  *  "The  probability  that  the  target  of  interest  contains  on-board 

D 

many  more  systems  of  the  form  A  than  that  of  B  is  not 
really  very  high." 

Let  the  number  of  systems  on-board  the  target  of  forms  A  and 
B  be  represented  by  the  joint  probability  (and  possibility) 
function 


PA, B  :  Jo  x  Jo  **  [0'1J  ' 

where  JQ  is  the  set  of  all  nonnegative  integers. 

Let  "many  more"  correspond  to 

PM,M  :  Jo  x  Jo  t0,1]  ' 

where,  e.g. ,  pM  M(x,x)  =  0  ,  but  Pm(m(x'y*  iS 
nondecreasing  in  y  ,  etc.. 

If  "high"  is  first  modeled  as  a  truth  modifier  with  possibility 
function 


etc . 


PH  :  [0,1]  -♦  [0,1]  ,  pH  nondecreasing, 

then  one  way  to  model  "not  really  very  high"  is  by 

d  ,  1+a  .  .  . 

PN  =  1  -  PM  ;  constant  a  >  0  . 

Then, 


iiS-ii 

o 


If 


=  H  (NK  (3x)  (3y)  ( (X,y)  €  ( A ,  B )  &  (x,y)  e  MM)))II 

=  PNtf  ^or  (VPA,B(X'Y)'  PMM(x'y))))  • 
x,y*Jo 


* 


or 


that 


prod  and 


bndsum  is  chosen,  then  it  follows 
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"V  "  pN<VPm,m(X))>  ' 

where  X  is  a  random  variable  over  J  x  J  corresponding  to  p,  „ 

o  o  A ,  B 

and  Ex( •)  is  statistical  expectation. 

The  following  additional  examples  illustrate  further  the  use¬ 
fulness  of  general  fuzzy  set  systems  in  modeling  and  interpreting 
unertainties  in  decision  and  estimation  problems  where  evidence  may 
consist  of  a  mixture  of  probabilistic  and  natural  language  infor¬ 
mation,  or  only  the  latter.  Abbreviations  for  fuzzy  sets  or 
attributes  will  be  self  explanatory  as  much  as  possible. 


Example  7. 


ii  (Men  are  better  operators  than  women.  )ii 
=  X  (lev,  op(x)  ,  lev.  op(y)  ) /card(Men)  •  card(Women) 

x«  Men , 

•  ye Women 

or,  alternatively, 

=  (  X  lev.  op(x) /card(Men) ,  X  lev.  op(y) /card (women) ) 

xeMen  ye Women 

where 

:  R  x  r  ^  [0,1] 

is  a  fuzzification  of  ordinary  inequality  or  equality. 


Example  8. 


il  (Most  men  are  better  operators  than  most  women.  )li 

fcard(x|x  e  Men  &  lev.  op(x)  is  better  than  most  women. 


" ^  ^Most [ 
where 


card (Men) 


]" 


4> 


Most 


with  <|> 


[0,1]  -  [0.1]  ;  <p(x)  a  0  ,  for 
Most  monotone  increasing  up  to 


card(x  |  x  «  Men  &  lev.  op(x) 

*  *  <p  (x) 

xeMen  fis  better  than! 

[most  women  J 


x  S  1/2  , 

1  after  that; 
is  better  than  most  women) 


<t>  (x) 

is  better 
than  most 
women 


^Most 


fcard(y|y  e  Women  &  ^(lev.  op(y),lev. 
card(Women) 


op  IX)  j ) 


’ 

t 


card((y  |  y  «  Women  &  <p^(lev.  op(x),  lev.  op(x) ) ) ) 

«•  X  ( lev.  op(y)  ,  lev,  op(x)  )  . 

y« Women 
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Example  9. 

H (The  identities  of  A  and  B  are  believed  to  be  incorrect; 
most  likely  they  should  be  reversed.)11 

"  ^(^believe^&^not(^(identtrue(A)’identpast  est.(A)))  ' 

*not<*-  (ldenttrue<B)'identpast  est.<B>>>>>  ' 


*most  (V*=(ldenttrue{A)  , ident 
likely 


past  est. 


<B )  )  . 


where 


*=<lden^rue<B>'identpast  est.(A)))) 
identtrue'  ldentpast  est  :  Ships  ln3-CtlV-- »  Names 


Example  10. 


li(I  intend  to  launch  system  C 
help  F )  H 

=li  (My  intended  launching  of  C 
F )  H 


against  D  and  E  in  order  to 
against  D  and  E  is  to  help 


^or  ^because  ^launch  (**C»D»E) 

(0st,t‘)  against,  t 


^help, t  *  ■ 1 ' ' ' ^intend  ^ t  ^ 


Example  11. 


H (Country  A's  ships  are  now  widely  distributed  with  no  unusual 
activities . ) n 

"  ^or(<)>&(<},high(scatter(shlps(A)))^now(t)  • 
t«R 

<*>usual(actlv'  ^eve^ (ships (A) ) ) )  , 

where 


scatter (Ships(A) ) 
etc . 


=  z  dial ( loc(x) , loc(y) )/( card  Ships(A))2  , 

x,y«ships(A) 


Example  12. 


ii  (It  always  rains  in  Minnesota,  according  to 
=  ii  (Very  often  it  rains  in  Minnesota  according  to 


* 


v, of ten 


t«D 


4> 


rain,  t 


(Minn. ) /card  D) 


Sam 


where 


3Sam  is  some  sufficiently  large  set 
according  to  Sam. 


of 


Sam )  ii 
Sam)  li 


days  of  recording 


Example  13. 

it  (It  sometimes  rains  here  at  night  a  couple  of  days  later 
following  an  unusually  hot  day)n 
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=  $some  (Z(<*or  ! 'V  ^usually ( ^hot , day ( t ) (here) } ' 
t€D  fit  high 
f  «D 

*rain,night(t)(here)  '  Couple  of  ( t . t • ) ) ) ) /card(D) )  . 

days  between 

where  D  is  some  sufficiently  large  set  of  integers  representing 
days  of  weather  reporting. 


Example  14. 

ii  (Green  flares  were  sighted  shortly  after  the  initial  contact 
which  used  equipment  A )  il 

=  «or  <<«,&(Veen(color(x)'<f,sight,t(sYStem'x)'  *was(t>' 
xeFlare 

■Wrtly(<t,s(t,'tn'  ‘*W.t,(SYStem'A)  '♦initial  * 

after  contact 


Example  15. 

it  (Contact  with  A  was  held  for  a  long  time)n 

=  »or  <V*contact,t(sYstem'A>'V  ,{t>  '*was(t,))  ' 
t.t'iO 

where  Ct,  is  the  fuzzy  interval  representing  "for  a  long  time 
beginning  at  about  f " .  Thus 

*C  .(t)  =  *or  (^(^(f't),  ^(t.f  +  t»)  .*long(t">)>  . 

*  f  ,  tSO ,  time 

t'+t"S0 


Example  16. 

ii  (A  is  continuing  his  search  of  targets  in  B  despite  heavy  losses 
to  his  vessel)  ii 

=  4>  (<pp(<l>  w  *  . (A,x) ,$n(loe(x) ) ,  *  (t)  , 

Tor  Tsearch  of,t  TB'  Tnow 

xeTargets  continue 

*heavy(loss(A))))  * 


Example  17. 

til  tracked  the  ship  for  (about)  2  hours  and  (then)  lost 
contact )  ii 


308 


Goodman  and  Hguyen 


=  ♦or  (^^track.t(I'ship>'<»>At.(t)^not(nrack,f'(1-ship))  ’ 

where  A^,  is  the  fuzzy  time  interval  from  fuzzy  time  around  t* 
to  fuzzy  time  two  hours  later  (around  t'  +  2)  ,  and  thus  for  any  t 

4>a  (t)  -  VVt1  *t)  ,^(t,f  +  2))  . 

4>  •  R  x  R  -»  [0,1]  is  a  fuzzification  of  ordinary 


equality. 


B^,  is  the  fuzzy  time  interval  from  around  time 


t‘  +  2  +  t"  until  0  (present).  Thus,  for  any  t 

«PB  (t)  =  <f>&(^s(t'  +  2  +  t",t)  ,  ^(t.0>)  . 

t 1  ,  t 

Example  18. 

ii  (it  is  possible  to  stop  action  C)n 

=  ♦or  (^(Vss(a)^stop;a,t<system'C)^will(t))) 

0 SaS 1  , 
tkO 


where 


♦poss^a^  ♦not ^ ♦necess ^ 1 


Example  19. 


ii  (Most  likely,  action  C  can  be  stopped)" 

♦or  {V*most  '♦poss(a))  Atop;a,t(system'C)^will(t>>> 
OSctSl,  likely 
t2  0 


Example  20. 


ii  (It  Is  impossible  to  stop  action  C)» 

=  ♦or  (V$not(Vss(a))'^stop,a,t(system  C)  '  ♦will  < t  >  >  > 

OS  as  1 , 
tzo 


Example  21. 


ii  (A  employed  system  B  but  was  unable  to  follow  the  target)1 

=  ♦or^^use.t(A"B)^was(t)^not^follow,t(A’tar9et)>^ 
tSO 
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Example  22. 


II  (It  was  stated  yesterday  that  "because  of  several  factors,  A 
will  not  be  able  to  refuel  at  B")n 


^or  ^yest  ^  '  ' ^because  ^var  fact ,  t  ^ ' ^not  ^refuel 

*will(t'  -  t)))  . 

where 

^var  fact,t(A)  =  Several*  /  ♦fact  j  ,  t  ( A) /card(  J)  ) 

and 

J  =  set  of  all  relevant  factors. 


( A ,  B )  )  ) 
at ,  t 


Example  23. 

ii  (It  may  be  necessary  to  stop  action  C)n 

*=  <t)  ( cb  _  ( cb  [<p  (a)),<b  .  *.  (  system  ,  C  ),  i>  <1n(t)))  ; 

0  «aS  1 
tiO 

it  (It  will  be  necessary  to  stop  action  C ) » 

"♦or  (^(^necess(a)'<f,stop;a,t(sYSt:em'C)^will(t))>  ' 
OSarSl 

tso 


Example  24, 

ii  (We  were  going  to  do  this  but  instead  because  cf  condition  A 
we  did  that) H 

=  ^or  (V*do,t(we'thls)'*were  (t)' 
tSO,  going, to 

t  'St"S0 

*because(Vt'(we>’^o,t-’(we'that))  ' 

<Wt')  '  ♦was(t")))  ' 

where  for  all  u,v  «  [0,1]  , 

♦because (u'v)  S  < *G (u ' v) ' u > 

Diagramatically ,  we  have: 
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Example  25. 

Law  of  excluded  middle  need  not  hold  here: 


ii  (On  the  whole,  some  small  countries  are  both  friendly  and  not 
so  friendly  to  strangers )n 


A  (A  i . 

Ton  wh  Tsomel 


where 


fcardfsmall  coun.  n  fr.  n  not  so 
card (small  coun.) 


fr .  ) 


card(small  coun.  n  fr.  n  not  so  fr.) 


z 

xe countries 


V*small(poPul(x,)'*fr  .nnot  so  fr .  ^x^  ' 


^fr.nnot  so  fr. 


(x) 


,  *  „  ^a^fr<x'Y)^not  so<*fr 

(yeVlst .  to  x) _ _ _ 

card(Vist.to  x) 

card (small  coun.)  =  Z 

x« countries 


<*,y>)> 


^smal 1 ^  popu* ( x ) )  • 


Example  26. 


ii  (John  is  tall,  yet  not  that  tall)ii 
=  V*tall(ht(John))'*not  so<*tall(ht(John))))  • 


Example  27. 

Redundancies  are  employed  here: 


it  (He  is  a  large  man;  Indeed  he  is !  ) ii 

=  C^large (ht(he)  'wt(he)  )  '^large(ht(he)  'wt(he> )  ) 
man  man 


Example  28. 


ii  (A  little  more  than  one  third  of  the  ships  observed  were  from 
country  B)ii 

^or^&^little  mores  1/3 


( 


xc  Shins 


V*ob8e4?*'Vx)>/ 


xeshins 


^observ , t  ^ x  ^ ' ^was 
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Example  34. 


where 


li  (All  of  my  goats  eat  about  three  times  a  day)N 

-  <e&  (x),  <$>  (x))j  , 

x«Goat  Goat (Me)  eat  about  three 

J  “  times  a  day 


^eat  about  three ^ 
times  a  day 

=  *or 


(4>(x) 


M>  (n)))))  , 


(n.i.2..)(«t1<v...<Vt0m-i,...»)  "°“nd 


and  for  each  i  , 

<)>  (x)  =  4> 

eat  around  _ 

time  t,  u 


OS  tit 


( x)  ,  4>  (t)))  ; 

eat,t  around  t. 


t  =  end  of  day  in  hours,  e.g. 


Example  35. 


H (All  of  my  goats  eat  very  frequently  each  day)n 

is  the  same  as  the  above  example,  except  for  the  replacement  of 

^about  (n)  by  *very  lh)  '  tor  a11  n  "  1 ' 2  •  •  •  • 

three  frequently 

Example  36. 

H (A  fat  man  who  hates  himself  was  talking  to  a  thin  one . ) n 
=  V(lf,or  (V*fat(wt(x),ht{x)},  *  (x,x)  ,  <J>was  ( t )  ,  <f»  (x,y)  , 

tiO  x,y«Men  hate,t  talk.t 


talk, t 


,t»thin(wt(Y)  'ht(Y) )  )  > ) 


Example  37. 


li  (John  hasn't  been  here  for  a  month,  but  Bill  has.)u 
“  <f&(<l>or^&^  (John, here) ,$  (t),<j>  (t)))  , 

is,t  <  past  now 

month 

I'or  (Bill, here)  ,4>  (t)  <f  ( t )  )  )  )  . 

tSO  is,t  ipast  ,  now 

month 


Example  38. 


il  (Larry  and  Luke  punch  each  other,  but  Pat  and  Mike  don't)n 
=  ^(l*  ( Larry, Luke )  ,<j>  (Luke, Larry)  , 

punch  punch 

^not ( ^  (Pat, Mike), <)>  (4>  (Mike.Pat)))  . 

punch  punch 
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Example  39. 

ii  (Bill  asks  John  about  himself  (John))ll 
=  <}»  (Bill ,  John,  John)  . 

ask  about 


Example  40. 

ii  (Bill  asks  John  about  himself  (Bill)  )n 
=  $  (Bill, John, Bill)  . 

ask  about 


Example  41. 


ii  (John  is  being  easy  to  please )H 

=  for  <Vfeasy  to  < *»*»)  *f  <*>>> 
t*R  please, t  now 


Example  42. 

H (John's  intelligence,  which  is  his  most  remarkable  quality, 
exceeds  his  foresight .)  ii 

*  $& ( ^ ( meas ( Intel 1( John) )  ,  meas ( forsight ( John) ))  , 

<}>_(max  meas(x)  ,  meas ( int ell  ( John) ))  ) 
xcQual ( John) 


Example  43. 


ii  (What  Thompson  said,  which  Smith  criticized  him  for,  is  that 
we  should  visit  Bolivia) ll 

11  f&(f0r(f&(f  ( Thompson, x)  ,  $  (t)))  , 

tso  say't 


was 


and 


for(f&(f  ( Smith, Thompson, x) ,$  ( t ) ) ) )  , 

.<n  criticize  was 

for,  t 

x  ^  "we  should  visit  Bolivia"  , 
o 

<t>  :  (People)x  (statements  made  at  t)  -*  [0,1}  , 

say,  t 

"xo"  “for  <f&(fvisit  a(we'Bolivla) 'f  («>>)  • 

osasi  '  should 


Remark . 

As  a  final  note  to  this  section,  complete  truth  functionality 
has  been  assumed  for  ll  •  ll  of  the  logics  involved.  Alternatively, 
semi-truth  functional  logics  could  have  been  utilized,  such  as 
probability  logic.  (See  section  2.3.9). 
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CHAPTER  3 


UNCERTAINTY  MEASURES 


This  chapter  is  devoted  to  the  study  of  different  measures  of 
uncertainty.  The  major  application  of  this  -  as  will  be  seen  later 
-  is  to  knowledge-based  systems.  (See  Chapters  8  and  9.) 

By  an  uncertainty  measure  for  a  given  system,  we  mean  a  numeri¬ 
cal  valued  function,  usually  in  the  form  of  a  membership  function  or 
dispersion,  which  reflects  the  degrees  of  certainty  or  uncertainty 
of  various  objects  of  interest  or  events  from  the  system. 

In  the  spirit  of  Chapter  2,  uncertainty  measures,  alternative¬ 
ly,  may  be  thought  of  as  special  cases  of  dispersions  and  may  be 
directly  connected  with  formal  language  and  semantic  evaluations: 
They  are  semantic  evaluations  of  certain  relations,  whose  signa¬ 
tures,  when  evaluated,  become  the  universes  of  discourse  or 
semantical  domains  of  the  uncertainty  measures  at  hand.  For 
example,  probability  measures  and  related  measures  and  their  gen¬ 
eralizations  Including  Dempster-Shaf er  or  Choquet  capacity-type 
measures  all  may  be  considered  in  this  light.  See  sections  2.3.3 
(C)(4)  and  2.3.9. 

Typically,  for  a  probability  space,  uncertainty  is  measured  by 
V  :  a  -•  [0,1]  where  x  £  *{X)  for  some  base  space  X  ,  with  x 

being  also  a  o-algebra  and  p  satisfying  the  standard  normalized 

measure  axioms  (as  formulated  by  Kolmogorov,  e.g.).  The  higher  the 
value  //(A)  for  A  «  x  the  more  certain  or  "probable"  A  is.  By 
weakening  (or  omitting  altogether)  the  structures  on  x  and/or  m 
above,  more  generalized  concepts  of  uncertainty  measures  are  ob¬ 
tained  such  as  the  Choquet-type  measures  or  possibility  measures, 
among  others.  (See  Dubois  and  Prade  [51]  for  a  summary  of  axiom 
inclusion  relations  between  various  proposed  measures  of  uncertain¬ 
ty.)  On  the  other  hand,  many  kinds  of  uncertainty  measures  may  be 
derived  from  probability  measures  by  use  of  suitable  transformations 
such  as  the  logarithmic  one  yielding  statistical  information  mea¬ 
sures  and  in  turn  (through  the  expectation)  statistical  entropy. 

In  summary,  uncertainty  measures  may  be  considered  essentially 
as  structured  dispersions,  often  unnormalized,  as  in  the  case  of 
information  measures,  but  usually  normalized  or  bounded  as  in  proba¬ 
bility  measures,  possibility  measures,  or  Choquet-type  measures. 


3.1  Probability  theories  and  information  theory. 

The  most  common  approach  to  uncertainty  modeling  is  based  upon 
some  form  of  probability  theory.  For  an  excellent  treatise  on  var¬ 
ious  theories  of  probability,  including  Kolmogorov-axiomatic,  Von 
Mises'  frequentist  and  Savage's  subjective  interpretations,  we  refer 
the  reader  to  Fine  [58].  For  logical  and  philosophical  issues  en¬ 
tailed  in  the  application  of  probability  theory  and  Its  derivative 
field,  mathematical  statistics,  to  uncertainty  modeling,  see,  e.g., 
the  excellent  papers  of  Good  [76],  Savage  [221],  De  Finetti  [46*], 
Llndley  [ 158  '  ]  . 

From  probability  theory,  the  logical  path  to  statistical 
Information  theory  was  established  by  Shannon  [232].  Obviously, 


315 


316 


Goodman  and  Nguyen 


general  information  theory  plays  an  important  role  in  the  theory  of 
knowledge.  In  statistical  information  theory,  the  measure  of  infor¬ 
mation  supplied  by  probabilistic  experiments  is  connected  to  the 
amount  of  uncertainly  Specifically,  the  information  measure  is 
defined  as  follows: 

Let  (n,.*,P)  be  a  probability  space.  Define  J  :  a  -*  R+  by; 

J ( A)  =  -log  P  { A)  ;  all  A  «  a  . 

General  information  theory  (Kampe'  de  Feriet  and  Forte  [132]  r 
see  also  Guiasu  [99])  is  based  essentially  on  the  idea  that  it  is 
possible  to  define  information  measures  by  a  set  of  axioms.  More 
precisely,  if  a  is  a  class  of  subsets  of  a  ,  then  a  mapping 

J  :  *  -*  R+  is  called  an  information  measure  if: 

(i)  A,B  e  a  and  A  £  B  =>  J(A)  i  J(B)  . 

(ii)  If  0,n  «  a  ,  then  J(0)  =  ■*»  ,  J(h)  =  0  . 

Several  information  measures  are  defined  with  additional  struc¬ 
tures.  E.g. ,  when  a  is  an  algebra  of  sets,  J  is  said  to  possess 
a  regular  composition  operation  F  if 

J (A  U  B)  =  F( J (A) ,  J(B) )  ;  all  A,B  e  a  , 
where  U  denotes  disjoint  union  and 


(a) 

F  : 

R  +  X 

R+  -*  R+ ,  continuous  , 

(b) 

F(x, 

Y)  - 

F(y,x) ,  vx,y  e  r+  , 

(c) 

F[x, 

F(y, 

z)]  »  F[F(x,y),z]  ,  (associativity) 

<d) 

F(x, 

+-) 

= 

(e) 

*1  < 

X2 

=»  F(x1,y)  S  F(Xg ,y)  ,  for  all  y  e  R+ 

Note  that  such  an  F  induces  a  topological  semi-group  on  R+  (see, 
e.g.,  Paalman  de  Miranda  [201]).  For  example,  for  the  Wiener- 
Shannon  information  measure,  we  have 


-ii  _Y 

F(x,y)  =  max  (0,  -c  log(e  c  +  e  c))  ,  c  >  0  . 

If  J ( A)  =  l//i ( A)  where  Is  a  measure  on  a  measurable  space 
(n.-A)  ,  then  F(x,y)  =  (^  +  y)  1  (hyperbolic  semi-group).  (See 
also  [136].) 

An  important  class  of  information  measures  related  to  possibil¬ 
ity  theory  (Zadeh  [218])  and  Choquet's  capacity  (Choquet  [36])  is 
the  class  of  information  measures  of  type  Inf  (see,  e.g.,  Nguyen 
[185],  Langrand  and  Nguyen  [151])  defined  as  follows: 
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Let  r  :  J1  -*  R+  such  that 
inf(r(w)  |  «  «  n}  =  0 

Define  J  :  »{ft)  -*  R+  by: 

J(A)  3  inf{i»(u)  |  «  A}  . 

The  regular  composition  operation  of  the  above  J  is 

F(x,y)  =  x  a  y  ;  all  x  ,  y  . 

For  example,  (see  Kampe  de  Fe'riet  and  Nguyen  [133])  let 
f  :  R+  -»  U  ,  and  for  A  £  U  , 

J (A)  =  inf (t  i  0  |  f(t)  €  A)  . 

It  is  clear  that  J(A)  (the  hitting  time  of  A)  is  an  information 
measure  of  type  Inf.  If  we  extend  this  notion  of  hitting  time  to 
multi-valued  mappings,  then  all  information  measures  of  type  Inf 

can  be  constructed  this  way.  Indeed,  if  f  :  R+  -♦  »(U)  ,  then 

J(A)  =  sup { t  i  0  |  [  U  f ( s ) ]  n  A  =  0)  . 

OS  Sit 

Conversely,  if  J(A)  *  inf(r(u)  |  u  «  A)  ,  then  J  is  the  hitting 
time  of  the  set  f(t)  “  (u  «  U  |  *(u)  <  t)  . 

In  a  narrow  sense  of  information  theory,  the  concept  of  proba¬ 
bilistic  entropy  is  the  cornerstone  of  theoretical  investigation  of 
the  transmission  of  information  over  communication  channels.  More 
generally,  the  uncertainty  about  a  stochastic  system  can  be  measured 
by  the  entropy  of  the  random  vector  or  random  set  describing  the 
system.  When  the  probabilities  cannot  be  directly  evaluated  but 
some  evidence  about  the  system  is  available,  say,  in  the  form  of 
expectation,  or  higher  moments,  the  principle  of  maximum  entropy  (or 
minimum  cross  entropy)  (Jaynes  [123]),  extending  Laplace's  prin¬ 
ciple  of  insufficient  reason,  can  be  used  as  a  selection  criterion 
for  constructing  probability  distributions.  We  will  discuss  this 
principle  in  the  context  of  random  sets  in  5.4.  For  an  axiomatic 
foundation  of  the  principle  of  maximum  entropy  and  of  minimum  cross- 
entropy,  see  Shore  and  Johnson  [235].  For  the  applications  of  the 
maximum  entropy/  principle  to  statistics  and  statistical  mechanics, 
see,  e.g.,  Kampe  de  Feriet  [130];  see  also  Kullback  [145’]. 


Remark. 

We  present  an  example  connecting  basic  set  operations  as  formal 
language  descriptions,  semantic  evaluations,  logical  validity,  and 
probabilistic  and  informational  change  relative  to  Probability 
Logic.  (See  again  Chapter  2  for  background  especially  sections 
2.3.3  (C),  2.3.4,  2.3.5  and  2.3.9  (A).) 

For  any  i,j  «  Var(0b(£))  ,  letting 


m  ^  i  x  ft*  x  j 
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and  choosing  any  individual  variables  (projections)  x 


U)  „  («'  ) 


z  'u  ] ,  recalling  the  specially  designated  function  symbol  for  set 
m 

membership  (in  the  general  sense)  («  :  Ob (t)  -•  Rel(£))  ,  consider 

wff's 


*  <xmU)  €i  Ym(n  ]) 


*2  ~  €if<xm(i)«Zm(n  *> 

Suppose  that  (Z4)  unions  hold  as  either  an  axiom  or  theorem 
(see  2.3.4  or  [172],  pp.  70  -  86). 

-  «v  •,  -  *))  ■ 

«i  V”1’-  O' 

is  in  Wff  [t)  ,  for  all  i  in  Var(0b(4))  ,  where  y  ^  ^  , 

m 

a<Q  ^  ,  p^^  are  all  arbitrary  individual  variables  of  the  appro¬ 

priate  types,  where  we  define  via  class  abstraction 

,(fli)  o  t(Ri)  3  a(Ri)  . 

Alternatively,  we  can  use  the  class  abstraction  function  symbol 
{•!••}  and  a  natural  associated  tautology 

*-  6i  {^(Ri)  u  e(ftl,n  c=»  n )  . 

where  we  recall  from  2.2.2, 

({•!••)  :  Ob  (l)  x  Ob(i )  x  RelU)  Ar  (*)) 


(fi1)  ,,  „  (n1)  d  .  .  .  „i  .  , 

y  '  u  { '  '  =  (i  |  n  xj,/7) 

3  W.X)  ,  „  . 


using  Individual  variable  notation. 
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In  turn,  this  Implies  In  a  natural  way 

in1*  /  in1'  m1) ) 

1  s.  ,  v*  «-»  {  * , 

a* 

implicitly  defining  generalized  subset  relation  £  A  . 

n 

Specializing  these  results, 

„  '“‘l  2  y  I”1)  u  z  I”1'  a  .  y  <al>  .  z,1"1'  . 
m  Jm  m  i  m  m 

define 

r  =  e  [<x  (i* ,w  *>] 

3  i  m  m 

,  (i)  (n1), 

=  (Xm  Wm  * 

m  Ira 

-  rx  or  r2  . 

Let  ii  ii  :  L  -*  ( SET, H*  )  ,  H  =  [0,1]  be  a  semantic  evaluation 

for  PL.  Thus  Hill  is  some  a  -algebra  over  Xi  ,  some  ordinary  set, 

with  similar  remarks  for  ii  jii  ,  tinil  =■  <0,1}  and  im^it  =  a  <  <0 ,  l}11  }  , 

by  identification  nx  ^**H  :  limn  -♦  Hill  ,  lly  *il  :  ii  mil  -♦  ni2in  , 

m  m 

Hz  ^ a  *  ii  :  ii  mu  -*  nniii  ,  are  projections,  where 
in 

ii  mil  =  a  (II  in  x  imiH  x  II  jil  )  . 

Also,  recall  that  iie^H  :  iiixn^u  -*  H  is  a  probability  measure, 
where  ilixfi^il  =  ofllill  x  II 12 ^ II  )  .  Then 


Pr(r . )  =  (iix  '  Ml  ii «  .11  ii y  '  'll) 

l  in  l  m 

^  nr^i 

=  lie  .11  { 11  x  * 1  *11  x  lly  * n  *  II  ) 
i  '  m  xm  ' 

=  lie  .11  (ilx  * 1  *  II  (•  )  x  (lly  *1 

l  m  m 


(•  •  )  )  o  IIXm(1)ll  (•  )  ) 
in 


and  similarly, 


Pr(r2)  s  h^ii 


=  it e  ii o  ( ii x  '  'h  (>)x( H z  '  'll  (*  •  )  )°Hx  1  Mi(-))  . 
l  m  in  in 

But  compatible  with  the  definition  for  unions  and  Horn  , 


ii w  1,(1  *11  «  lly  ^ n  *11  U  llz  *li  :  II mil  -♦  II 12 1 11  . 

m  'n  m 
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Note  that  this  is  equivalent  to  the  sequents  (?1  »-  ?3)  , 

(*2  i-  ?3)  being  valid  relative  to  ••  H  as  used  here,  i.e., 

negative  information  increase  or,  equivalently  probability  increase 
on  wff's  is  the  same  as  validity  for  PL. 

Analagous  results  are  obtainable  for  other  set  operations  such 
as  complementations,  cartesian  products  and  sums,  and  more  general 
compound  set  operations. 
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3.2  Choquet-tvpe  ■easnres:  Basic  Properties 


In  this  subsection,  the  Choquet-type  family  of  uncertainty  mea¬ 
surer  is  p-erented.  The  following  development  is  based  on  the  works 
of  Choquet  [36],  Shafer  [228],  Dempster  [46],  as  well  as  Nguyen 
[187],  Goodman  [90]  and  Hohle  [116]. 

Let  X  be  a  fixed  base  space  with  a  £  *(X)  with  either  a 

closed  under  all  finite  intersections  -  indicated  by  a^  -  or  closed 

under  all  finite  unions  -  indicated  by  a^  .  Let  a  denote  gener¬ 
ally  either  situation.  Let  p  :  a  -*  [0,1]  be  an  uncertainty  mea¬ 
sure  over  a  for  X  .  We  say  p  is  monotone  1  ncreas  I  ng  iff 
V  C  £  D  «  a  ,  p  (C)  S  /v  ( D )  and  p  is  monotone  decreasing  iff 

v  c  £  D  e  a  ,  fj(C)  2  p(D)  .  In  addition,  let  (*,*)  =  (rt,u)  or 
=  (U,n)  .  Define  for  any  integer  nil,  the  Choquet  operators 

n 

^n.'n  :  x  *  -•  [0,1]  :  for  any 


?n  =  'B1 


,Bn)  ,  Bj  «  a  ,  j  *  i, . . . ,n  , 


'V  “  ^  (-l)Card  K.p(  *"b  *"b  ) 

K£  { 1 . n-1 }  jeK  3  n 


v  (p.»  ,*  , B  )  §  p(  *V) 

je  { 1 ,  .  .  .  ,n 


-  2  (-l)card(K)+i, 


(-1) 

}  0*K£{1 . n) 


Pi  *  B  ) 
j€K  3 


Definitions: 


Let  p  :  a  -  [0,1]  be  an  uncertainty  measure. 

P  is  a  plausibility  or  upper  probability  measure  iff  vn  and 
Bn  ,  ^n(/j,U,Bn)  S  0  &  p{0)  »  0  . 

p  is  a  belief  or  credibility  or  lover  probability  measure  iff 
vn  and  Bn  ,  ^n(p,n,Bn)  2  0  &  p(X)  =  1  . 

p  is  a  doubt  or  commonality  measure  iff  Vn  and  B 

~n 

^n(A'.u,Bn)  2  0  &  p[0)  =  1  . 

p  is  a  disbelief  or  Incredibility  measure  iff  ^(X)  =  0  &  Vn 
and  Bn  ,  A^p.n,  Bn)  SO. 


Remark : 

Following  Choquet’s  original  nomenclature  [36],  a  plausibility 
measure  is  a  "U-alternating  [Choquet)  capacity  of  "-order" ,  a  be¬ 
lief  measure  is  a  "n-monotone  Choquet  capacity  of  ®-order" ;  a  doubt 
measure  is  a  "u-monotone  Choquet  capacity  of  "-order" ,  a  disbelief 
measure  is  a  "n-alternating  capacity  of  »-order". 


322 


Goodman  and  Nguyen 


1  Fix  n  =  2  . 

If  VB2  ,  v2(A/,n,U.B2)  2  }  0  &  p(X)  =  {  J  or 

if  vb2  ,  ^2(/i,u>B2)  2  )  0  ,  then 

.  f  decreasing 

•*  is  mon°tone  [  increasing  * 

If  VB2,*2(/J,U,n,B2)  2  ]  0  &  fJ(0)  =  {  ° 
or  if  VB2  ,  A2(fj  ,n  ,B2)  2  |  0  ,  then 

"  is  monotone  (  decreasing  * 

lema  2  Fix  n  . 

If  p(X)  =  0  ,  then  vjBn  ,  vn(/i,n,u,Bn)  *  0  iff  VBn+1  , 

'W'i'u'SnHl)  *  0  • 

If  p<X)  -  1  ,  then  VBn,  vn{/j,n,u,Bn)  i  0  iff  vBn+1  . 

^n+1 ^ ,U '~n+l )  25  0  ‘ 

If  fj(0)  =  0  ,  then  VBn  ,  7n(Ai.U,n,Bn)  i  0  iff  VBfi+1  , 

Jn+1  ^ ,f1  '~n+l  >  2  0  • 

If  p(0)  =  1  ,  then  VBn  ,  vn(p  ,U  ,n ,  Bn)  S  0  Iff  VBn+1  - 

Jn+l(#/'n'5n+l>  5  0  • 

"  2  1 

Proof :  =»  :  Given  the  form  An+i^r*  '~n+l^  £  ]  0  choose 

B  =  *  oB  .  «=  :  Given  the  form  v  (V  ,*  .  *  ,B  )  2  0  ,  VB  , 

n+1  ~n  n  -  n  i  ~n 

replace  B^  by  B.*"  5n+1  »  j  =  l,...,n  ,  and  then  use  Lemma  1. 

Lena  3 

If  p  is  a  plausibility  measure  or  a  belief  measure,  it  is 
monotone  increasing. 

If  p  is  a  doubt  measure  or  a  disbelief  measure,  it  is  mono¬ 
tone  decreasing. 

Lemma  4 

p  is  a  plausibility  measure  over  iff  1  -  p  is  a  doubt 

measure  over  a^  . 
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p  is  a  belief  measure  over  iff  p[C- )  is  a  doubt  measure 

over  X  H  Sq  , 

fj  is  a  disbelief  measure  over  iff  1  -  p(c-  )  is  a  doubt 

measure  over  X  -4  ,  where 

X  1  ®n  =  (X  H  B  |  B  e  =  {CB  |  B  e  ,  etc. 

The  following  theorems  have  straightforward  proofs  which 
involve  some  messy  computations. 

Theorea  1 

Let  v  be  an  uncertainty  measure  over  }  (X)  (class  of  all 
finite  subsets  of  X)  or  over  X  H  } (X)  ,  whichever  is  appropriate. 
Then : 

(i)  is  a  doubt  measure  over  J  (X)  Iff 

fj(0)  -  1  &  VG  e  }  ( X)  ,  A'q1)  *  0  <  where  for  any  C  a  G  , 

=  Z  (-l)card(K,/J(C  D  K)  , 

KSGHC 

where  G  n  C  =  {x1>...,xm>  ,  Xj  «  X  ,  j  *  1 , . . . , m  . 

(ii)  (J  is  a  belief  measure  over  X  H  J  (X)  iff  ( X )  =  l  & 

v  G  «  J(X)  ,  (X  -4  C})  i.  0  ,  where  for  any  C  a  G  , 

p'xhg({XHC))  =  'dm+l(/1'n'X'l<xl} . XH(xm),X  h  C) 

=  Z  (-l)card(K)^(X  H  (CUK))  . 

KSGHC 

(iii)  p  is  a  plausibility  measure  over  J(X)  iff 

v(0)  =  0  fit  SO  ,  V  G  €  } ( X)  . 

(iv)  fj  is  a  disbelief  measure  over  X  H  }(X)  iff 

v  c  a  G  «  1  ( X )  ,  h  C})  s  0  . 

Theorea  2 

If  v  is  a  doubt  measure  over  j(X)  ,  then  v  G  e  J (X)  , 

(i)  0  S  S  1  ;  indeed  is  a  probability  measure  over 

**{G)  corresponding  to  random  subset  of  G  . 

G 
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(ii)  v  C  s  G  , 

X  A«^1)({B})  =  p(C)  =  Pr(C  <=  S{1))  =  /i^X)(c  (»(G))  , 
CSBSG  G 

where  c  (»(G))  =  { B  |  CSBSG}  is  the  superset  or  filter  class 
c 

on  C  in  »(G)  . 

In  particular, 

//(11){{  0))  -  z  (~l)card  -p(B)  ,  /j* X)  (  <e> )  =  p(G)  , 

BSG 

and  as  a  check 

#/*1)(®( G))  =  P(*>  =  :  ;  =  0  . 

( iii )  If  C  S  G1  s  G2 

/i*l)((C))  =  ^X)(CC(»(G2)))  , 

and  more  generally,  for  any  A  £  9(G^)  , 

/j^Ua)  =  x  a»I1) ( <c> )  =  ^1)(pi1(-«)  a  (G,))  J 

G1  Ce*  Gi  G2  G1  2 

p~V)  Q  <g2>)  3  {C  I  c  s  g2  &  c  n  G1  €  A)  , 

where  pQ  Is  the  standard  projection:  pQ  (A)  =  G^  n  A  .  (See 

section  4.2.) 

Remark. 

For  Theorem  2  above  and  for  the  following  two  theorems,  all 
results  involving  probability  measures  over  classes  of  sets  may  be 
reinterpreted  directly  in  terms  of  random  sets.  (See  Chapter  4.) 


Theorem  3 

If  p  is  a  belief  measure  over  X  ±  J  (X)  ,  then  V  G  e  J (X)  , 
[  2  ) 

(i)  P„  '  is  a  probability  measure  over  ?*(X  -*  G) 
XHG 

(  2  ) 

corresponding  to  a  random  subset  S'  '  of  X  4  G  . 

XHG 


il)  v  c  s  G  , 


5  /'x4G((B>)  =  H(X  H  C)  =  s  r  H  C) 


XHG 


(XHGSBSXHC) 


,(2) 


P^Hq(®X-,c(?.(X  -*  G )  )  )  ,  where 
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I 


=  {X  h  B  |  B  £  G}  and  pj^(a)  *  for  any  a  =  X  ±  x  , 
x  £  »(G)  ,  is  defined  analagous  to  Pg1^-*)  . 

Proof :  Use  Theorem  2  and  Lemma  4 . 

Remark 


Analogues  of  Theorems  2  and  3  for  plausibility  and  disbelief 
measures  may  also  be  established,  utilizing  Lemma  4.  An  alternative 
approach  to  the  uncertainty  measures  discussed  above  is  presented  in 
section  4.3,  where  Choquet's  capacity  theorem  is  used. 


Theorem  4 

(i)  Let  a  be  a  fixed  Boolean  ring.  Then  p  is  a  finitely 
additive  probability  measure  over  a  iff  p  is  both  a  plausibil¬ 
ity  and  a  belief  measure  over  a  . 


(ii)  If  p  is  a  semi-distributive  t-norm  possibility  measure, 
i.e.,  p  is  a  monotone  and  V  C,D«  ,  p(CU  D)  “  p( C)  +p(D)  - 

<}>&(p(C)  ,p(D) )  ,  for  t-norm  <{>&  ,  then  p  is  a  plausibility  measure. 

(iii)  If  is  a  semi-distributive  t-norm,  then  for  any 

A  «  *(X)  ,  PA  &(*)  -  ( <(» A 1 3C )  )  :  J(X)  -♦  [0,1]  is  a  doubt  mea- 

x«  ( •  ) 

sure . 


(lv)  If  $  is  a  semi-distributive  t-conorm,  then  for  any 

A  «  *(X)  ,  PA  or(-)  ~  $or  (<1>A(x)  )  :  J(X)  -  [0,1]  is  a  plausibility 

x«  ( •  ) 

measure . 

(v)  If  <|>&  ,  <}>or  are  a  DeMorgan  pair  with  respect  to 
*nt  3  1  '  (,)  '  then  ^A.or  =  1  -  "CA,&  * 


(vi)  In  general,  no  p  is  both  a  finitely  additive  proba¬ 
bility  measure  and  a  t-norm  possibility  measure. 


Remark . 

Recall  (see  section  2.3.5)  that  a  dispersion  space  (X,A,p) 
consists  of  J(X)  £  x  £  *(X)  and  p  :  A  -»  [0,1]  ,  an  uncertainty 
measure,  p  can  arise  as  simply  a  given  function  or  p  may  be  de¬ 
rivable  from  an  initial  function  such  as  from  $A  :  X  -►  [0,1]  and  a 

fixed  function  such  as  a  t-norm  (inf,  e.g.)  or  t-conorm  (sup,  e.g.) 
as  given  above  in  Theorem  4  (ill),  (Iv).  In  the  latter  case,  p  may 
be  thought  of  as  an  extension  of  $A  ,  with  the  identification 

p(  (x) )  «  A ( x )  ;  all  x  «  X  . 
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3.3  Possibility  and  related  measures. 

The  notion  of  possibility  distributions  was  introduced  by  Zadeh 
[281]  for  the  analysis  of  situations  in  which  the  uncertainty  is  not 
statistical  in  nature,  especially  in  "humanistic"  systems. 

With  the  notation  of  3.2: 


Definition. 

Let  a  £  »(X)  and  p  :  a  -*  [0,1]  be  an  uncertainty  measure. 
Then  p  is  a  t-possibility  measure  iff  p  is  a  monotone  measure 
and  U  B2)  =  p(B1)  +  /i(B2)  -  t(p(B1)  ,/j(B2>  )  for  all  Bj^  e  a  , 

where  t  is  a  (mutually)  distributive  t-norm  (t  *  <|>r)  .  In  particu- 

of 

lar,  Zadeh's  possibility  measure  p  (B)  =  sup  U  (x)  for  any  B  £  X 

°  X€B 

and  given  U  ,  where  :  X  -*  [0,1]  is  any  dispersion  (or  fuzzy  set 
membership  function  or  possibility  distribution  function)  is  a 

t-possibility  measure,  where  t  ^  a  (min)  .  (See  Klement  [140]  or 
others  for  extensions  of  Zadeh's  theory.)  Note  that  the  possibility 
measure  satisfies  the  following  property: 

For  any  index  set  I  ,  and  A^  £  X  ,  lei  , 

u  A.)  =  sup  (p( A.)  |  i  «  I)  . 
i«I  1  1 

Conversely,  given  any  t-possibility  measure  fj  the  associated 
possibility  distribution  function  is  defined  to  be: 

%  :  X  -  [0.1]  , 


where 


ff^(x)  3  p((x))  ;  all  x  «  X  . 

When  X  is  a  topological  space,  a  possibility  measure  p  can 
be  a  precapacity  or  even  a  special  Choquet  capacity?  for  example 

p(A)  =  1 / J ( A)  where  J(A)  =  inf($(x)  |  x  «  A)  ;  A  £  X  ,  with 

<}>  :  X  -♦  R+  being  lower-semi-continuous . 

In  an  approach  to  the  problem  of  meaning  representation  in  na¬ 
tural  language  (Zadeh  [283]),  the  law  which  governs  the  variable  V 
in  a  proposition  of  the  form  <V  is  A>  ,  A  e  ?(X)  ,  is  defined  as 
a  possibility  distribution  ff  =  <J>^  ,  expressing  the  degree  to  which 

each  element  x  e  X  "belongs"  to  the  generalized  set  A  .  It  should 
also  be  noted  that,  in  the  formulation  of  the  concept  of  values  for 
non-atomic  games,  Aumann  and  Shapley  [9]  are  led  to  consider  "ideal 
sets"  which  are  formally  fuzzy  sets  in  the  sense  of  Zadeh. 

We  specialize  now  the  general  formulation  of  conditioning,  pro¬ 
jections  and  Interactions  (see  section  2.3.8)  to  the  case  where 
<P&  =  a  and  $or  =  v  .  (See  also  Nguyen  [188].) 

Recall  that  (section  2.3.8)  if  A  «  7(XxY)  and  B  «  ?(X)  , 

then  the  conditional  generalized  set  (A  |  x  «  B)  «  5(Y)  exists  for 
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all  x  «  X  provided  sup  $.{x,y)  s  $n(x)  ,  Vx  €  x  ,  and  t-norm  <t> _ 

yeY  A  B  & 

is  continuous.  Furthermore,  denoting  truth  content  by  n  il  , 

ii  (ye  (A  |  xe  B))  &  (xe  B )  ll  =  ll(x,y)  e  All  , 

i.e.,  assuming  truth  functionality, 

U)  (y)  .  <f>R(x))  *  <})A(x,y)  ,  v  x  e  X  . 

Also,  the  X-projection  of  A  is  defined  as 

(2)  Voj„(A)(x)  =  *or{Vx'y))  '  V  x  €  X  ' 

ye  Y 

and  the  two  projections  proJx(A)  and  projy(A)  ,  are  said  to  be 
non-interactive  iff 

(3)  A  =  projx(A)  x  projy(A)  . 

When  <f>&  =  a  and  $or  =  v  ,  (1),  (2)  and  (3)  become,  respec¬ 

tively: 

(4)  $  (y)  a  (}>  (x)  =  $.(x,y)  ,  V  x,y  e  X  x  Y  , 

( A  |  xe  B )  B  A 

{5)  Vojv(A)(x)  =  SUP  VX'y>  '  V  X  e  X  , 

a  y®  y 

(6)  V*'Y)  =  Vojx(A)(x)  A  Vojy(A)(y)  *  v  x,y  e  X  x  Y  . 

An  explicit  form  for  4*(a|xsB)  (y)  is  9iver*  as  follows: 

*A(X*y)  '  lff  Vojx(A)<X>  *  VojY(A)(y) 

*A<X'V>  •  Vo3Y(A)(Y)  '  Vojx(A)(x) 

lff  ^projx(A)(x)  <  ^projy(A)(y)  1 

Now  let  ^  be  a  possibility  measure  with  associated  possibil¬ 
ity  distribution  function  n  =  n ^  .  In  constrast  to  probability 

theory,  the  non-interaction  of  variables  has  to  be  defined  (as 
above)  directly  from  possibility  distributions  and  not  from  possi¬ 
bility  measures.  This  is  due  to  the  fact  that 

AM  A  n  B)  »  m(A)  a  y(b)  does  not  imply  necessarily  one  of  the 
following  relations: 

(i)  fJ  ( A  n  CB)  =  fj(  A)  a  a#  (  CB )  , 

(ii)  fJ  ( CA  n  B)  =  CA)  a  ji(B)  . 

(iii)  ^(CA  n  CB)  =  fj(  CA)  a  *i(CB)  . 


^  ( A  |  xe  B )  ^ y  ^ 
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The  extension  of  the  domain  of  possibility  measures  to  include 
generalized  sets  is  as  follows: 

For  A  «  9(X)  , 

A)  ^  sup  {n  (x)  a  <|»A  ( x )  |  x  e  X)  . 

Another  extension  is  based  upon  integration  with  respect  to  capa¬ 
cities  : 

For  A  «  9 (X)  ,  we  have 

A  r1 

A<(A)  =  sup  {JT{x)  I  x  €  A}  =  A<(At)  dt 

J0 

where  we  note  p( A^)  is  well-defined  for 
A,.  ^  {x  «  X  I  4>a(x)  >  t)  . 

For  the  connections  between  dispersions  or  possibility  distri¬ 
butions  and  probability  theory,  via  random  sets,  see  Chapter  4. 

We  mention  now  a  problem  of  inference  using  possibility  theory. 
Let  U  be  a  variable  taking  values  in  X  with  possibility  distri¬ 
bution  ffy  =  <}>A  induced  by  the  proposition  <U  is  A>  .  From  this 

proposition,  one  can  infer  that: 

Poss ( U  is  B)  =  ( B )  -  sup  {<j>A(x)  A  4>b(x)  I  X  €  X)  . 


This  inference  rule  can  be  interpreted  intuitively  as  follows: 
For  each  x  «  X  ,  the  dirac  measure  is  a  possibility  measure 

with  the  following  Interpretation  of  conditioning:  Knowing  that  x 
is  the  only  possible  v?1 le,  the  conditional  possibility  measure  of  a 


crisp  set  B  is  6X(®) 
More  generally, 


i.  iff  x  «  B 
0  iff  x  e  B 


r  i 

if 

A  n 

B 

*  0 

«a(B)  = 

i 

i  0 

if 

A  n 

B 

,  for  any  ASX 

=  0 

Note  that 


VB)  4 


sup  {<}>A(X)  A  $B(X) 


X  €  X)  . 


Again,  it  is  natural  to  extend  conditional  possibility  measures  to 
generalized  sets  by  the  same  formula,  replacing  A  and  B  . 

Now  if  U  is  a  random  variable,  defined  on  the  probability 
space  (n,A,Pr)  ,  with  probability  distribution  Py  on  X  ,  and  V 

is  a  (fuzzy)  variable  taking  values  in  Y  depending  on  U  in  the 
following  way:  given  that  U  =  x  ,  the  strict  range  of  V  is  some 
subset  T  of  Y  .  Thus, 
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Poss(V  =  y  |  CJ  =  x) 


'  1 

if  y  «  T. 

.  0 

otherwise 

and  Poss(V  |  U  =  x)  <=  or  more  generally  Poss(V|U)  =  *}>ToU  , 

x 

where  o  is  functional  composition. 

If  we  identify  Py  with  (U,Py)  and  X  with  Poss(V|U)  ,  we 

can  consider  an  evidence  €  ^  (Py,Poss(V|U) )  .  Note  that 
Poss(V|U  =  x)  is  a  random  set  in  Y  .  It  is  easy  to  see  that 


Poss(V|U  =  x)  “  0  «=>  Tx  =  0 

ASY,AnT  *0«s»An  Poss(V|U  =  X)  *  0 
Tx  £  A  «=*  Poss ( V|  U  =  x)  £  A  . 

Therefore 

Pr,(A)  «  Pr(Poss(V|U)  £  A) 


Pr  (A)  «=  Pr(Poss(V|U)  n  A  *  0) 

* 

where  Pr,  ,  Pr  are  lower  and  upper  probabilities  (Dempster  [46]) 

associated  with  (O.^.Pr)  and  the  multi-valued  mapping  Poss(V|U)  . 
(See  also  Chapter  4,  especially  Theorems  2  and  3.) 

The  random  set  Poss(V|U  *  x)  induces  a  possibility  measure 
defined  by 

Poss(V  «  A  |  U  =  x)  *  sup  (Poss(V  »  y  |  U  »  x)  |  y  €  A) 


or  a  random  possibility  measure: 


Poss(V  e  A  |  U) 


■  l  if  A  n  tx  *  0 
.  0  otherwise 


Note  that  Ey(Poss(V  c  A|U))  =  Pr*(A)  where  Ey  denotes  probabil¬ 
istic  expectation  with  respect  to  the  random  variable  U  . 

The  concept  of  conditional  possibility  measure  can  be  used  in 
the  analysis  of  evidence  (see,  e.g..  Zadeh  [282]).  For  example, 
consider  the  problem  of  making  inference  about  probability  laws  of 
random  variables.  As  before,  let  U  be  a  random  variable  with 
values  in  a  measurable  space  (X,c)  .  Denote  by  D  the  set  of  all 
probability  densities  on  X  .  By  an  evidence  e  ,  we  mean  proposi¬ 
tions  which  specify  that  the  true  density  fQ  of  U  lies  in  some 

subset  D(e)  of  D  .  The  subset  D(e)  is  referred  to  as  the  con¬ 
straint.  The  Inference  procedure  consists  of  finding  an  estimate  of 
f  based  upon  the  information  contained  in  the  evidence  c  .  As  a 

classical  example,  let  X  be  the  state  space  of  some  stochastic 
system,  and  e  is  expressed  in  the  form  of  a  constraint  on  the 
expectation  of  U  ,  say,  E(U)  -  a  .  The  evidence  e  specifies  the 
subset : 


D(6)  ■  (f  «  D  |  J  xf(x)  dx  =  a)  . 
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An  estimate,  say  f  ,  of  f  can  be  obtained  by  using  the 

principle  of  maximum  entropy  (Jaynes  [123],  see  also  section  5.4). 
If  B  «  c  ,  we  infer  that: 

Pr(U  e  B  |  t  )  =  f  f (x)  dx 
JB 

Now  if  the  evidence  t  is  of  the  form  <U  is  A>  such  that 
<|>A  is  measurable,  then  e  does  not  specify  the  true  density  f0 

but  does  specify  a  weaker  law,  namely,  a  possibility  distribution 
expressed  as  *  ^A  '  and  in  ^is  sPirit»  B  is  another  fuzzy 

event,  i.e.,  B  e  *(X)  ,  with  <[>B  measurable,  we  have: 

Poss(U  is  B  |  t)  =  p(B  |  A)  =  fj(B  |  ffy  =  4»a) 

■  sup  (4>a(x)  a  4>b (X)  I  X  €  X)  . 


Relations  between  possibility  theory  and  other  uncertain 


■eas ures . 

First,  the  concept  of  fuzzy  measures  is  given  by  Sugeno  [248'] 
as  follows: 

Let  (X,A)  be  a  measurable  space.  A  mapping  g  :  A  -*  [0,1] 

is  called  a  fuzzy  measure  iff 

(I)  g(0)  *  0  ,  g(X)  *  1  . 

(ii)  A.B  e  a  ,  A  £  B  =*  g(A)  S  g(B)  , 

i.e.,  g  is  monotone  increasing,  and  g  satisfies  the  following 
monotone  continuity  condition: 

(iii)  A  «  a  and  (A  )  f  (resp. , (A  )  4)  implies 

n=l,2,  ...  n=l , 2 , . . . 


g(  U  An)  =  lim  g(An)  . 
n=l  n-*-K» 


(resp.,  g (  n  A  )  =  lim  g(A  ).) 

.  _  n-*+«> 

n=l , 2 ,  .  . . 

Secondly,  Sugeno  defined  a  A-fuzzy  (or  A-additive)  measure  as 
the  mapping 

gA  :  *  -*  E  0 , 1  ]  ;  -1  <  A  <  +  » 

such  that 

(a)  gA(X)  =  1  . 


(b)  gA(A  U  B)  *  gA(A)  +  gA ( B )  +  A • gA (A)gA (B)  . 
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(c)  gA  satisfies  condition  (ill)  above. 


It  is  easy  to  verify  that  (see,  e.g.,  Dubois  and  Prade  [51]). 
(a)  All  probability  measures  are  fuzzy  measures. 

(fi)  Any  A -fuzzy  measure  is  a  fuzzy  measure. 


{■>) 

Banon 

[11]  has  shown. 

the  following: 

Let  g 

A  be  any  A-fuzzy  measure.  Then 

(i) 

If  A  z  0  ,  gA 

is  a  belief  measure. 

Ui) 

If  -1  <  A  s  0 

,  gA  is  a  plausibility 

measure . 

(ill) 

If  A  =  0  ,  gA 

is  an  ordinary  probability  measure. 

<«) 

Kruse 

[146]  developed 

an  extension  theorem 

for  A-fuzzy 

measures,  and  showed  that  the  unique  translation  invari¬ 
ant  a -fuzzy  measure  over  e  (=  B[0,1])  ,  the  o-algebra  of 
all  Borel  subsets  of  [0,1]  ,  is  given  by 


d  1  vol  (A) 

gA(A)  =  f ({!  +  A)  1 


-  1) 


A  e  C  , 


where  vo^  is  Legesgue  measure  over  [0,1]  . 

(e)  Banon  [11]  also  showed  that  a  (Zadeh)  possibility  mea¬ 
sure  is  a  belief  measure  (or  a  A-fuzzy  measure)  iff  it 
is  a  dirac  (i.e.,  one  mass  point)  measure. 

(See  Banon  [11]  for  a  summary  of  various  relations 
for  uncertainty  measures.) 

However,  as  shown  in  Puri  and  Ralescu  [206],  in  general  a 
(Zadeh)  possibility  measure  is  not  a  fuzzy  measure. 

Finally,  note  that  the  concept  of  probabilities  of  fuzzy  events 
(Zadeh  [283])  is  formulated  in  Klement  [139']  as  follows: 

Again,  let  (X,A)  be  a  measurable  space,  and  let  c  be  the 
Borel  o-algebra  of  [0,1]  .  Denote  o  [x)  the  fuzzy  o-algebra 

generated  by  A  ,  i.e.. 


o(a)  =  (f  |  f  :  X-*  [0,1]  (  f  is  x  -  C  measurable)  . 

A  fuzzy  probability  measure  is  a  mapping  m  :  o(ji)  -*  [0,1] 

such  that 


(i) 

m(0)  -  0  , 

m(  1 )  * 

1  (for  the  0-  and 

functions ) 

(il) 

modularity 

holds , 

i.e.  , 

v  f,g  « 

o  (x)  , 

m{ f  v  g) 

+  m(  f 

a  g)  =  m(f)  +  m(g)  , 
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(ill)  increasing  continuity  holds,  i.e., 

if  f  «  a (A)  and  f  t  f  then 
n  n 

m(fn)  t  »(f)  • 

If  m  :  o  (a)  -*■  R  satisfies  (ii),  (iii)  and  m(0)  =  0  ,  then 
m  is  called  a  fuzzy  measure  in  Klement's  sense.  It  should  be  noted 
that  this  concept  of  fuzzy  measure  is  different  from  Sugeno's  fuzzy 
measure  in  general.  The  characterization  of  this  type  of  fuzzy  mea¬ 
sures  is  given  in  Klement  [139']:  if  m  is  a  finite  fuzzy  measure, 
then  there  exists  a  unique  finite  measure  P  on  (X,jI)  and  a  P  - 
almost  everywhere  unique  Markoff -kernel  K  such  that: 

V  f  «  a(A),  m(f)  =  J  K(x, [0,f (x) ) )  dP(x)  . 

xeX 

(See  also  Klement,  Schwyhla  and  Lowen  [141].) 


Remark . 


From  a  semantic  evaluation  viewpoint,  the  above  investigation 
can  be  enlarged  by  replacing  a  ,  v  by  other  t-norms  and  t-conorms. 


CHAPTER  4 


RANDOM  SET  THEORY:  INTRODUCTION 


This  chapter  contains  necessary  background  on  the  theory  of 
random  sets  which  will  be  used  in  Chapters  5  and  6  to  establish 
formal  connections  between  possibility  and  probability  theories. 


4.1  Background . 

By  random  sets  we  mean  random  elements  whose  possible  outcomes 
are  subsets  of  some  given  space.  Random  sets  often  arise  in  statis¬ 
tical  problems  in  the  form  of  families  of  confidence  sets.  In 
sampling  from  a  finite  population,  a  sampling  design  is  precisely  a 
random  set.  The  theory  of  random  sets  has  been  mainly  developed  by 
Kendall  [137]  and  Matheron  [171]. 

Let  (n.jt.Pr)  be  a  probability  space,  and  X  be  an  arbitrary 

set.  Let  *  s  *(X)  ,  and  «  be  a  o -algebra  on  t  .  A  random  set 
S  is  ah  x  e  measurable  mapping  from  (a.A.Pr)  to  (c,c)  .  The 

induced  probability  measure  on  (C  ,C)  is  Pg  «  Pr°S  1  .  As  in  the 
case  of  random  vectors,  given  a  probability  space  of  the  form 
(c  ,c  ,Pg)  ,  id  :  C  -*  C  is  referred  to  also  as  a  random  set.  When  X 

is  arbitrary,  C  can  be  9 (X)  ,  and  c  =  oja)  ,  the  o-algebra  gen¬ 
erated  by  a  ,  where 

a  3  {m( 1,1*)  |  1,1*  e  }(X)> 

where  J(X)  ,  or  simply  J  ,  is  the  class  of  all  finite  subsets  of 
X  ,  and 

M{  I ,  I ' )  =  (A  «  a (X)  |  A  2  I,  A  n  I*  =  0} 

When  (X,»)  is  a  measurable  space,  one  might  take  c  =  a  ,  and 

hence  c  is  a  o-algebra  on  a  .  In  practice,  X  will  be  a  locally 

compact  Hausdorff  and  separable  space,  e.g.,  and  a  is  its 

Borel  o-algebra.  In  this  topological  setting,  compact-convex-valued 
random  sets  generalize  random  vectors  (when  restricting  to  singleton 
sets) .  Some  natural  extensions  of  results  for  random  vectors  have 
been  obtained  for  random  sets  -  for  mostly  compact-valued  random 
sets;  other  analogues  remain  to  be  established.  When  the  base  space 
X  is  a  metric  space,  the  class  of  non-empty  compact  sets 
£•  ■  X  {0}  is  a  metric  space  with  the  Hausdorff  distance,  so  that 
compact-valued  random  sets  are  measurable  mappings  taking  values  in 
(*•,*(*•))  ,  where  »(*')  is  the  Borel  o-algebra  of  *'  .  The  fact 
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that  **  is  a  metric  space  allows  us  to  consider  weak  convergence 
of  probability  measures  on  sc*  and  study  asymptotic  problems,  e.g., 
central  limit  theorems.  Also,  the  integration  of  multi-valued 
mappings  (see,  e.g.,  Debreu  [45],  and  section  4.4)  can  be  formulated 
so  that  the  problems  of  the  law  of  large  numbers  for  random  sets  can 
be  addressed. 

When  X  is  locally  compact,  the  class  y  of  closed  sets  of  X 
can  be  topologized  in  a  suitable  way  (Matheron,  [171])  to  yield  a 
compact  space,  and  closed  random  sets  can  be  defined.  For  these 
random  sets,  incidence  functions  play  the  role  of  distribution  func¬ 
tions  of  random  vectors.  Specifically,  the  incidence  function  Tc 
of  the  random  set  S  ,  a  special  type  of  Choquet's  capacity,  is  de¬ 
fined  as  follows: 

T0  :  *  -  [0,1]  , 

TJX)  =  Pr(S  n  K  *  0)  , 

(Kendall  [137]  uses  the  terminology  "trapping  functions".)  Note 
that  T„  characterizes  an  equivalence  class  of  closed  random  sets 
(Matheron,  [171]). 

In  the  case  of  (X,o(3))  ,  the  space  law 
T0  :  J(X)  -  [0,1]  defined  by 
T0(I)  ^  Pr(S  n  I  *  0)  ;  I  «  J  (X)  , 

characterizes  also  an  equivalent  class  of  general  random  sets. 

In  this  book,  we  will  mainly  consider  random  sets  on  (*(x), 
o(a))  .  A  characterization  of  these  random  sets  will  be  given  in 

terms  of  probability  coverage  functions  which  are  relevant  to  our 
analysis  of  uncertainty  (see  Chapter  5).  However,  to  compute  ex¬ 
pected  measure  of  random  sets  and  higher  moments,  since  elementary 
proofs  are  not  available,  we  will  call  upon  topological  structures 
and  abstract  measure  theory. 

The  reader  interested  in  the  development  of  random  set  theory 
towards  statistical  applications  can  find  main  references  on  these 
issues  in  the  bibliography  at  the  end  of  the  book.  We  will  not 
discuss  random  sets  arising  from  stochastic  point  processes  or  geo¬ 
metric  covering  problems.  However,  we  will  say  a  few  words,  in  5.5, 
about  randomized  set  estimators  (and  tests)  because  of  the  analogy 
in  spirit  with  the  investigation  we  are  going  to  carry  out. 

The  fact  that  random  set  theory  can  be  viewed  as  a  tool  toward 
providing  a  unified  treatment  of  uncertainty  measures  is  the  main 
reason  why  it  will  occupy  a  large  portion  of  this  book. 


4.2  Structure. 

In  this  section,  we  study  structures  of  random  sets  and  their 
connections  with  uncertainty  measures.  As  stated  before,  random 
sets  are  essentially  set-valued  random  variables  or  vectors. 

Ordinary  random  variables  are  singleton-valued  random  sets.  More 

rigorously,  consider  the  following:  The  natural  topology  on  (0,1  }X 
is  y (X)  ,  the  product  topology  induced  by  the  discrete  topology 

over  each  factor  space  (0,1)  .  y  (X)  is  a  compact  Hausdorff, 

totally  disconnected  (and  separable,  if  X  is  at  most  countable) 
topology.  It  follows  that  the  same  properties  hold  for 
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r<*o)  £  4>-1(r  (X) )  Q  aQ  , 

the  natural  topology  for  power  class  s&Q  ,  where  it  is  always 
assumed  that 

0,X  e  aQ  s  9(X)  . 

A  typical  open  set  for  *(X)  is  an  arbitrary  union  of  sets  of  the 
form 

Va  1  (f  I  '  *  S  ,tx3>  '  •]  '  J  *  V  ■■ 

Xj  €  X j ,  a^  «  {0,1},  mil. 

A  typical  open  set  for  r(»Q)  is  a  union  of  sets  of  the  form 

♦'"'Va1  2  »o  ”  V  n  ’WUHC'I  ‘  MIC-G'  H  C'  )  ( extended  1  ; 

f«|x1i.i...).y!S'.(xJ|ii  i.)  . 

where  the  following  is  defined:  Let  B  «  »(X)  be  arbitrary  fixed. 
The  superset  coverage  class  on  B  ,  which  becomes  the  one-point 
coverage  class  on  x  when  B  *  (x> ,  x  «  X  ;  the  subset  coverage 
class  under  B  ;  and  the  Incidence  (or  trapping)  class  relative  to 
B  «  *(X)  ,  respectively; 

CB  £  <C  |  B  S  C  «  *Q>  ;  *B  £  {C  |  0  *  C  S  B  ,  C  €  *o>  ; 
eB  £  {C  I  C  A  B,  C  e  *o>  . 

The  above  concepts  are  closely  related  to  that  of  filter 
classes.  Indeed,  relative  to  »D  ,  subset  coverage  classes  are  all 

ultrafilters.  (See  the  remark  at  the  end  of  the  appendix  of  this 
section) . 

CB  ,  $B  ,  and  «B  are  all  closed-compact  for  r(»Q)  •  If 

B  «  J  (X)  ,  then  additionally  they  are  all  open.  Define  also  the 

special  projection  map 

PG  :  9(X)  9(G)  ;  pQ  ( C )  £  C  n  G  ;  all  G  «  J(X)  ,  C  €  9  (X)  . 

Then  letting 


*<V  '  ~.,V,W  !  ’a 'S'  3  P a1'"'0”  a  »< 

J  { X ) 


the  first  being  a  Boolean  ring  over  aQ  ,  the  second  a  o-algebra  for 
each  G  ; 
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r(»o)  =  top(e(ao))  -  o(«(ao))  =  o(c{x}  I  x  e  X)  £  o(r(aQ)) 

=  o(c1(ao) )  » 


where  ci(aQ)  is  the  class  of  all  compact  sets  relative  tc  r(aQ) 

In  addition,  it  easily  follows  that  cg  ,  aB  ,  eg  e  a (‘ »(aQ))  ;  all 

B  €  a(X)  .  All  the  above  equations  and  all  related  results  carry 

Y 

over  to  {0,1}  and  9"  { X )  via  the  membership  function  <j>  . 

A  primitive  random  set  (see  also  Wang  and  Sanchez  [262])  is  a 
mapping  SQ  ,  where  for  given  probability  space  (n,A,Pr)  and  any 

c ,  assumed  «  »Q  ,  all  x  e  X  , 


S  :  1 2  -  a  ,  S  -1 (c  )  *  A  ;  all  X  €  X  . 

O  O  O  (X) 

Any  primitive  random  subset  SQ  of  X  can  be  extended 

uniquely,  using  the  above  equations  and  standard  measure  extension 
procedures  to  correspond  to  the  complete  regular  Borel  probability 
space  (with  regularity  involving  finite  coverages) 

<V  yS  >  '•  Vs  3  Pr°So_1  ■ 

o  o 

This  extension,  also  denoted  as  SQ  ,  is  called  a  random  subset  of 
X  with  range  »Q  generated  by  the  primitive  random  set.  Corres¬ 
ponding  to  the  above  coverage  classes,  SQ  generates  coverage 

functions  (by  computing  probabilities  of  coverages). 

Note  the  distinction  between  specifying  a  one-point  coverage 

function  Pr(x  e  s  )  =  v  (C,  .)  as  a  function  of  x  ,  and  S 

o  %»o  \xj  o 

itself  as  a  primitive  random  set.  The  former  in  general  is  highly 
nonunique.  Thus,  for  any  C  «  v(X)  , 

Pr(S  «  C_)  =  Pros-1 (C  )  =  Pr(C  £  S) 

v  C 

as  a  function  of  C,  p.-.lC),  is  the  subset  coverage  function  /v 

v » /  (  o  ) 

for  S  .  In  particular,  for  C  =  (x)  ,  x  e  X  ; 

Pr(S  €  C(x)}  =  Pr{x  €  S) 


as  a  function  of  all  x  «  C 
M(S) ( (• > )  for  S  ; 


is  the  one  point  coverage 


Pr ( S  e  *c) 


as  a  function  of  C, 


(S) 


(1) 


for  S  ; 


=  Pr®S-1  (®c )  *•  Pr(0  *  S  £  C) 
p  .  ..(C)  is  the  superset 


coverage  function 
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Pr(S  «  ec) 


ProS_1(«c) 


Pr (S  A  C) 


as  a  function  of  C,  p  ,,.(C)  is  the  incidence  function  p  , 

(S)^J  (  S )  '  2 

of  S  ;  Pr(S  «  =  Pr°s  ^t®^)  =  Pr(s  *  c)  •  as  a  function  of  C  , 

f/3(C)  ,  is  the  compl  ement-l  nc  idence  function  (J^s^{3)  of  S  . 

The  basic  construction  of  random  sets  will  now  be  given  in 
detain  in  Theorem  1 . 


Theorem  1  (Construction  of  Random  Sets  from  Primitive  Random  Sets). 

If  SQ  is  a  primitive  random  subset  of  X  with  range  a 
then  Sq  induces  the  unique  probability  space  (»o,o (Cq(»o) ) ,v  )  , 

where  i/g  can  be  uniquely  extended  in  stages  over  the  nested 
o 

o-algebras  o( c0(®0))  S  0(c1(aQ))  s  ,  the  Baire,  Borel , 

and  complete  o-algebras,  respectively. 


Proof : 

The  construction  of  v  proceeds  as  follows: 

o 

(I)  Define  for  each  x  e  X  ,  (C.  )  -  Pr(S_1(c  (a  ))  and  more 

So  {x}  °  (x)  ° 


generally,  for  any  C 


.(C)  S*  v_  (C  (a  ) 
(So)  so  C  o 


Similarly,  define  the 

V(S  )  3  (v  (S  ) )  „ 

-  o'  x  o  x«  X 


«  J(X)  by 

a  *  pri»cs°1"<x» 

collection  of  0-1  random  variables 
through  the  joint  probability  relations, 


<v>> 


Pr (  &  (V  (S  )  =  1))  =  tJls  .(C)  ,  C  «  >(X)  . 

xec  x  °  l&o' 

Note  that  p .  _  ,  (C)  as  a  function  of  C  is  a  doubt  measure 
o 

over  j(X)  ,  since,  for  B^-.-.B  e  j  (X)  arbitrary, 


n-1 


,-l, 


n-1 


°  5  "<S„>{C8„  H  1  *  Pr(So  <CB  H  U  ,C,Bhub  »> 

o  n  j=l  j  n  n  j=l  j  n 


Bristle  )  H  Vs;‘(c  I) 

n  j=l  J  n 
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-1  n~  ^  -i 

“  1  -  pr((«  -*  SQ  NCgJ)  0  £Sq  (e  J) 

=  Pr(s"1(c  ))  -  5  (-l)card  K+1Pr(  n  S^(c  )) 

°  Bn  0»KS{l,...n-l)  j«K  BjUBn 


=  Pr(So1(CB  ))  +  2  (-1)  —  “  •  Pr ( S^-1  (C 


card  K 
n  0*KS{1,  .  .  .  ,n-l) 


,-l, 


o  '  UB.UB 

n 


)  ) 


=  V"(So)'U'!n> 


(II)  Thus,  by  part  (I),  V ( 5?q )  is  a  well-defined  (consistent) 

0-1  stochastic  process  uniquely  determined  by  A'/o  \  •  Then 

o 

there  is  a  unique  probability  measure  v.  .  which  extends  v.„  , 

to  the  Baire  and  Borel  o-algebra  levels,  the  latter  yielding  a  regu¬ 
larity  property ,  and  may  be  further  extended  to  the  complete 
o-algebra.  (This  is  due  to  standard  results  as  may  be  found  in, 
e.g.,  Halmos  [104*]) 


(III)  By  employing  the  fundamental  one-to-one  onto  mapping 

V 

$  :  »(X)  -♦  (0,1)  ,  cut  down  to  a  ,  all  of  these  results  carry 

over  to  aQ  and  o( c0(a0)  )  ,  etc. 

More  specifically: 


(i) 


Over  the  Boolean  ring  u (aQ) 

-  '■(s’)  <A| 

O  Ob 

G  €  }  (X)  ,  is  well-defined 


'  <^/<so»a<0’  ’ 

from  properties  of 


*  s  9(G) 

"<V8  ,See 


Theorem  1).  Then,  using  the  finite  additivity  of  o  .  .  over 

o 

u(a  )  and  its  downward  continuity  at  0  (using  properties  of 
topology  f (aQ)  ,  v(s  )  ^nt*eed  a  probability  measure  over 


«(®  )  .  (Note  PgU.) 

J  J 

the  Aj's  are  disjoint 


. 


,  j  =  will  be  disjoint  iff  all 

collections  of  sets  and 


(ii)  Extend  w(s  )  up  to  o(CQ(ao))  by  the  usual  outer  measure 

procedure.  (See,  e.g.,  Halmos  [1041])  Then,  in  turn,  extend 
up  to  o(Cj(a  ))  ,  noting  the  regularity  property  associated 

with  a  (C  (a  )  )  .  (Again,  see  Halmos  [104'].)  However,  the 

last  property  may  be  improved  upon  by  using  the  structure  of 
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Let  A  e  °( ® j(*Q) )  be  arbitrary.  Then  by  regularity  of 


v(So)  over  o(Cl(*o))  ' 

from  ao  .  c(1)  £  c(2)  £ 


sequences  of  classes  of  sets 


£  A  £ 


S  "*(2)  S  A(l)  '  where 


But  A 


( j) 


C  ( j  J  «  ci(ao)  and  “*(j)  €  r(ao}  '  ^  =  i'2'---'  with 

1  ‘'<s0)‘e<i)>  -  ■'<»„><*>  -  £  *  "<S0)<*(J)>  • 
as  an  open  set  for  r(aQ)  is  in  the  form 

*n.  «*  u  p~ 1  ( { C  } )  ,  for  some  C  £  G  e  J  (X)  ,  J  an 

(  j  )  _  -  o  d  d  d  1 

'J'  aej^  a 


(j)  ' 


arbitrary  index  set.  Then  by  the  compactness  of  say  c 
3  finite  integer  m|  ,  say,  such  that 
d  -1 

C(3)  S  6(j)  =  U^G  ^Cj  ))  By  choosing  j  such  that 

lc=  1  j  u  It 


C( j)  S  A 


l'(so)(c(j))  5  W(So)M>  5  y{so)(c(j))  +  1/J  we 


finally  obtain 


- 1/3  *  2S 

w(so)(e(j))  +  1/j  5  jJJ 


\ (Pr 

<So>  Gj 


W(S  1lPG  "1({C 

(So}  GJk  Jk 


1({C.  )))}  - 

k  jk 

3  >’>)* 


1/J  *  u(so){A}  s 


i/3  . 


yielding,  without  loss  of  generality, 


m3 

v  .(a)  =  lira  i  v.  .(e,..)  =  lim  i  2  fi 
1  o'  j-®  [So’  (J)  j-.®  k=lV 


t'(S  )(PG 
o 


1<{C1  >))) 


with  u[a  ^  e(j))  <  and  -*  n  c(j)  =  c(j)  s  c(j)  '  for 

j  =  1,2,  .  .  . 

In  particular,  for  a  =  c  (a  )  ,  C  e  *(X)  ,  we  can  show 

v  O 


ra. 


(J)  JVt  k=l 


may  be  chosen,  etc. 


(ili)  Finally,  letting  (see  Halmos,  Theorem  D,  p.  239) 

x  (a  )  2  (a  |  a  e  a  &  3  a’  «  a  with  a  £  a*  , 

(SQ)  °  o  o 

u  {S  ~  °>  <  we  can  iet  o  { c  x  ( aQ ) )  =  °<c1(a0))  U 

*U  (®n)  *  0(ci*a,-J  -  *,J  (®  ))  ,  where  for  any 

(SQ)  °  °  V(SQ) 
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.a  6  o(c1(aQ))  and  a  «  Jfy  (aQ)  ,  wg  (a  u  a)  =  vs  (a)  . 

(SQ)  o  o 

Remark 

The  following  natural  relations  hold  for  any  a.  «  o(c1(aQ))  , 
C  «  a (X)  . 

v{s  }(*)  =  Pr(SQ  «  a)  =  Pr(  S'1  (ji  ) )  . 

U(S  )(CC{ao)}  =  Pr(So  6  cC(aoJ)  =  Pr<c  4  So>  =  ^(s  )(C)  * 

O  '  o' 

In  particular,  for  C  =  (x) ,  x  «  X  , 


W(SQ) {x)(*o)  J  =  ^(So)({x))  • 
W(S  )(W>  "  Pr(So  €  W}  =  Pr(S  4  C)  ' 
w(So)(eC(ao))  =  Pr(So  e  ec(ao!)  =  Pr<S  n  c  *  0)  - 
‘'<S0)(*X-tC(V>  =  Pr<So  6  eXHC(ao)  *  =  Pr(S  4  C>  ' 

So  =  <t»_1(X.<S0))  n  ao  , 

(4>(S0))(<x))  =  <ps  ({x})  =  VX{SQ)  ,  etc. 


Dempster  [46]  first  obtained  the  following  important  rela¬ 
tions  . 


Theorem  2  (Dempster's  Theorem  [46]  -  Justification  for  the  Use 
of  Terms  Upper  and  Lower  Probability  Measures  as 
Alternates  for  Plausibility  and  Belief  Measures.) 


Let  (n.g.Pr)  be  a  given  probability  space  with 
se  !  0  -*  a0«  {#.X}  £  aQ  s  a(X),  So  being  a  random  subset  of  X  , 

thus  inducing  probability  space  (aQ,  o(c  (a  )),  vg  )  . 

o 

Then,  for  any  f  :  n  -  X  such  that  f  is  { g ,9 ) -measurable , 
where  a  £  *o  is  a  o-algebra,  and  f(w)  e  So(u),  v  u  «  n  ,  the 

induced  probability  measure  under  f  ,  Pr°f-1  over  a  satisfies 
the  inequalities 


V  ...(C)  s  Pr{0  *  £  C)  5  Pro  f-1 ( C )  s  Pr (S 

(S  ) °  ° 

o 


<v 


(2) 


(C) 


n  c  *  0) 


for  all  C  «  a  . 
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Proof : 

Straightforward,  see  Dempster  [46]  for  further  details  and  re¬ 
lated  results. 

Note  that  for  C  =  {x} ,  x  e  X  ,  Dempster's  inequality  becomes 
Pr(SQ  =  {x})  £  Pr»f-1(x)  s  Pr(x  «  Sq)  . 


Theorem  3  (Uncertainty  Measures  as  Random  Set  Coverage  Functions) 
Let  X  be  a  given  base  space  with  {0,X)  £  aQ  £  »(X)  .  Then 

(i)  If  is  a  doubt  measure  over  J(X)  ,  then  p  can  be  extended 
uniquely  to  »(X)  such  that  there  exists  a  (unique)  complete  - 
(finitely)  regular  probability  space,  i.e., 

(»  ,  o  (c  (a  )),  v  )  corresponding  to  random  subset  S,,  of  X 

O  U  1  O  O  U 

P 

with  range  ®q  such  that  for  all  C  e  »(X) 

P(C)  =  Pr(Sfj  e  cc(aQ))  =  vs  (cc(aQ)) 

(=  Pr(C  s  S^) )  . 

(ii)  If  fj  is  a  plausibility  measure  over  1(X)  ,  then  /v  can  be 
uniquely  extended  to  a(X)  such  that  there  exists  a  (unique) 
complete  -  (finitely)  regular  probability  space,  i.e., 

corresponding  to  random  subset 
such  that  for  all  C  e  »(X) 

P(C)  =  Pr  (  C  •  si_^  «  cc[*o)] 


(=  Pr(C  n  CS^  *  0)  )  . 

(iii)  If  p  is  a  belief  measure  over  »(X)  H  } (X)  then  v  can  be 
uniquely  extended  to  »{X)  such  that  there  exists  a  (unique) 
complete  (finitely)  regular  probability  space 

(a  ,  o  .  (C  (a  ) )  ,  v  (£.•))  corresponding  to  random  subset 

°  PIC-)  1  O  r‘  ( JC  ■  ) 

-*Sp(C*)  of  X  with  range  aQ  such  that  for  all  C  €  a(X)  , 


<V  0H(cl(9on'  "s 


l-p 


(C-  )  ) 


C-Sj  ^  of  X  with  range  aQ 


P(C) 


pr(i's„(C-)  *  V*o" 

(-  ,  1-S^.  ,  =  C)|  . 


(iv)  If  p  is  a  disbelief  measure  over  *(X)  ±  J (X)  then  p  can 
be  extended  uniquely  to  a(X)  such  that  there  exists  a  (unique) 
complete  (finitely)  regular  probability  space 

(aQ,  °  ^  c ,  j  j  (aQ) )  »  c.  )(£*))  corresponding  to  random  sub- 
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set  CS,  . „  .  of  X  with  range  a  such  that  for  all 
—  1  ~am£*  )  o 

C  €  »(X), 

"(c)  -  €  *cc(ao)  5 

=  1  "  I,l-/i(C->(i'Vao>> 

(-  Pr(CC  n  c  S1_A((C.  }  *  0))  . 


Proof : 

First  consider  J  (X)  and  »(X)  h  J(X)  ,  where  appropriate. 
For  (i),  since  a*  is  a  doubt  measure,  we  can  replace  in  part  (I)  of 
the  construction  in  Theorem  1,  .  by  p  .  Parts  (II)  and  (III) 

o 

follow,  (ii)  -  (iv)  follow  by  using  Lemma  4,  Chapter  3  in  conjunc¬ 
tion  with  part  (i)  above. 

Finally,  consider  again  (i)  for  the  unique  extension  from  } (X) 
to  *(X)  .  Cases  (ii)  -  (iv)  will  follow  similarly.  The  finite 

regularity  property  for  SQ  ,  shown  in  III  (ii)  of  Theorem  1  and  the 

result  shown  first  -  /i(C)  =  (Cc)  _,v  C  e  1  (X)  ,  demonstrate  that 

o 

for  C  «  *(X)  arbitrary,  we  should  define 


A*  ( C )  =  11m  i  )  ' 

j-.«  k-1 

the  definition  not  depending  on  the  particular  convergence  sequence, 
etc.  (See  also  related  results  of  HOhle  [116]  and  Goodman  [90].) 


Theorem  4  (Natural  Imbedding  of  a  Random  Variable  as  a  Random  Set.) 

Let  fj  be  a  finitely  additive  probability  measure  over  the 
Boolean  ring  J(X)  0  (X  ±  J (X) )  .  Then  p  can  be  uniquely  extended 
to  *(X)  such  that  there  exists  a  unique  (complete  finitely  regular) 
probability  space  (9 (X) ,99 (X) ,v ^  ^ )  corresponding  to  random  subset 

S  of  X  such  that  for  all  C  «  ?(X)  , 

(/O 


P(C)  =  Pr{0  *  S,,  S 
(AO 

More  specifically  S 

(") 

bility  subspace  (X,  »(X), 
C  £  ((x)  |  x  e  C)  S  -MX)  , 

V  .  .(A)  =  V  .  ( A 

(aO  (a 0 


C)  =  Pr(S  n  c  *  0)  . 

(AO 

can  be  shown  to  correspond  to 

y(p))  •  w^ere  for  any  C  «  9 (X)  , 
and  where,  for  any  A  e  99 ( X)  , 

n  X)  =  fj(u[A  n  x)  )  . 


proba- 


In  particular,  for  any  C 
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v 


(F> 


(CC(»(X))) 


f  0  iff  card  C  2  ,  C  «  »{X) 

l  p(  {x} )  iff  C  =  {x}  ,  x  €  X  . 


(Proof  •• 


From  Theorem  4,  Chapter 
belief  measure.  Then  S,, 

(A* ) 


3,  p  is  both  a  plausibility  and 
satisfies  these  conditions;  then  use 


uniqueness  of  constructions  as  in  previous  theorems.) 


Theorem  5  (Converse  to  Theorem  3:)  (Random  Set  Coverage  Functions 
as  Uncertainty  Measures) 

Let  X  be  a  given  base  space,  with  {0,X  )  £  »Q  £  a(X)  and 
some  random  subset  of  X  with  range  corresponding  to 

probability  space  (a  ,  o(e  (a  )),  v  )  .  Then  as  functions  of 

O  1  O  o 

O 

C  €  *(X)  , 


V 

V 

V 


(So)(C)  -  =  Pr(C  s  V 

sjw*  =°Pr(so  s  c)  ' 

«  P*(S0  n  C  *  0)  , 
o 


,(a  )  )  =  Pr ( S  *  C)  ; 


the  first  is  a  doubt  measure,  the  second  is  a  belief  measure,  the 
third  a  plausibility  measure,  and  the  fourth  is  a  disbelief  measure. 


Proof : 

Let  Bn  =  (B1(...,Bn)  ,  Bj  «  ?(X),  j  =  1 ,  .  .  .  ,n  .  For  the 

first,  use 


n-1 

o  i  u.  (c_  (a  )  h  u  c 


S  '~B  '  O- 
c  n 


B  .UB 


(»«)) 


j  4  M  jWW 

J“1  j  n 


n-1 


1  -  *3  <»o  H  CB  <V>  "  Vs  <  U,  VuB  <V>  • 

o  n  o  j=l  j  n 


Expand  the  last  union  in  terms  of  intersections.  Then  use  Lemma  4, 
Chapter  3.  For  example,  ws  (ec(aQ))  is  a  plausibility  measure 

o 

w.r.t.  C  iff  1  -  (c^ (»Q )  )  *  ycs  (cc(aQ))  is  a  d°ubt  measure, 

o  —  o 

which  follows  from  the  first  part  of  the  proof,  replacing  SQ  by 
£  So  • 
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Remark . 

Many  additional  results  relating  uncertainty  measures  with 
random  sets  may  be  obtained.  For  example,  Dempster's  combination  of 
evidence  operator  ®  (see  [282],  [228])  can  be  reinterpreted  in 

terms  of  statistically  independent  random  sets,  each  representing  a 
doubt  measure.  Specifically,  we  can  let 

Bel1(C)  =  Pr{0  *  Sj  £  C)  ,  j  =  1,2  ,  C  £  X  ,  Sj,  Sj 
being  statistically  independent  random  subsets  of  X  .  Then 

(Belj  e  Bel2)(C)  =  Pr(0  *  Sj  n  s2  £  C  |  n  S2  *  0)  . 
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Pr{si.1)  «  Jl  n  G.) 
G  j  “  J 


Note  also  the  regularity  approximations  from  Theorem  1 


Proof: 


First  use  a  form  of  Lusin's  Theorem  (Halmos  [1041],  pp.  242, 
243)  to  obtain  A^  s  ci^a0<  t'(s  )  H  -*j)  *  1/j  and  (f|A.j) 

continuous.  Then  consider  3  {go  (pQ  |ji ^  )  |  G  6  J  (X)  , 
g  :  »(G)  -*  R)  .  Each  g,  ( pQ I A -j )  and  hence  g°(PGl-*j)  e  is 


continuous.  For  *  =  t  or 


'  <fl1°PG1)*(®20PG2)  = 

.  ,  implying  that  X. 


((gl°PG  ^*(g2°pG  5  )opG  UG  €  Kj  '  lmP1Yin9  that  Xj  is  ar*  algebra 
of  functions  including  all  constant  functions  AoPG  s  A  ,  a  e  r  . 


G1'G2  €  be  such  that 


*  G2 ,  0 


Then 


PG  HG  <<G2})  =  0  and  pQ  HG  <Gi)  *  Gi  g2  *  0  ■  since  T(aQ)  is 
compact  Hausdorff,  it  is  normal  and  Tj  ,  the  latter  implying  {0}  , 
(G1  H  G2)  are  closed  sets.  Urysohn's  Lemma  then  implies  3  g 
continuous  g  :  -*  [0.1]  such  that  g(0)  =*  0  ,  g(Gj  H  G2)  =  l  . 

Hence  (SoPq^q^)  (  (Gj)  )  =  0  ,  (gopQ  iHg2H<gi))  "  1  ,  and  thus  H ^ 

is  a  separating  algebra.  Hence  by  the  Stone-Weierstrass  Theorem,  3 
Gj  «  J(X)  ,  g^  :  »(Gj)  -*  R  such  that 

sup  | f ( C)  -  (g^opp  ) { C ) |  <  1/j  .  Then  for  C  e  a  , 


|  E  (  f  ( S  )  )  -  E(  (fop  )  ( S  )  )  | 


C«S  -U. 
o  j 


( |f(C) I  +  |f(C  n  Cj) I )dy(s  } (C) 


+  J  If (C)  -  (gj°PG  ) (C) |dw(S  } (C) 

a  J  ® 


+  J  l(f°PG  )(C)  -  (gj°PG  )(C)|dy(s  J ( C ) 
3  3 

s  ( 1 / j ) ■  ( Mf  +  Mf )  +  (l/j)-l  +  ( 1/J ) *  1 


kemari . 

The  degree  of  information  lost  by  SQ  approximating  f  may  be 

measured  by  the  cross  entropy,  assuming  X  is  finite,  letting  V  = 
id  correspond  to  (n,x,Pr)  .where 
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Ent ( f { V) ,SQ) 


£  -  Pr(f-1(x)  |  S”1  (B )  )«log(Pr  ( f -i  (x)  |  SQ1  (B) ) )  , 


,-l 


-1 , 


.-1, 


fall  x«X 
I  B«a_ 


subject  to  the  constraint 

SQ  «  *f(X)  =  {SQ  |  SQ  «  a(X)  &  f{»)  e  sq(u)  .all  w  «  n}  > 

where  st(X)  is  the  class  of  all  random  subsets  of  X  . 

Then  clearly  for  any  given  f  :  ft  -»  X  as  above 


sup  Ent ( f ( V) , S  )  =  Ent ( f { V) ) 
Sq  e  *f(X) 


^  -  Pr(f_1(x)  )<log  Pr ( f_1  (x) )  . 
x«X 


occurring  for  Sq  =  X  ,  and 

inf  Ent ( f ( V) ,S  )  =  0  , 

SQ  ^  af(X)  ° 

occurring  for  SQ  =  {f{V)}  ,  i.e.,  SQ(u)  s  (f(w)}  . 


Remarks .  Marginal  and  Joint  Random  Sets  and  Functions. 

Let  (n,g,Pr)  be  a  given  probability  space  with  S£ 
a  random  subset  of  X  with  range  £  *{X)  . 

Often,  in  practice,  there  is  some  index  set 
n  0  *  aj  c  ,(Xj)  '  3  €  J0  '  with 


ft  -*  a 


JQ  such  that 


ga®  g  «  o  ( {  xx  x  x» 

l€j  ^  J  J 

J  O  j«Jj 


Xj  €  V  j  6  J1  S  Jo 


arb.  finite} )  , 


where  for  each  j  «  ,  g^  2  c1(»j)  or  2  r^aj)  •  can  be  shown 

or  c , 


that  for  «  g  ,  we  can  replace  x.  by  c 

j«j  J  txj > 

'W*/' Jor 


or 


j  «  Xj.  C  j ,  B  j  6  }  (Xj )  arb.,  J  s  Jj 


In 


addition,  we  may  take  *>  proj^  »  wher®  for  any 

B  =  (Bj)j€J  e  ft  ,  Bj  «  ,  j  e  ,  projk(B)  =  B^ ,  for  some  fixed 

k  €  JQ  •  (In  particular,  if  Jq  Is  a  singleton,  say  {0}  ,  then 
ft  =  »o  and  SQ  =  id  over  (ft,x,Pr).}  We  say  that  S^,  k  e  Jq  , 
is  a  marginal  random  subset  of  Xj  corresponding  to  probability 

space  (aic'?ic'Prs  >  <  Prs  =  Proproj"1  ,  and  <sk)k€j  is  a  J0~J°‘nt 
k  k  o 

collection  or  random  subsets  of  each  Xj ,  j  «  JQ  . 

Other  relations  between  random  subsets  are  required.  For 

example,  if  J  «  (1,2),  X,  »  X.  =  X  and  V  c  e  »(X),  c„  «  a  , 
o  1  i  Co 
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*  a2  =  aQ)  and  the  conditional  probability  measure  here  is  well 
defined,  then 

Pr(Sl  £  S2)  =  *Si(Pr(Sa|Si)(3l  =  S,  |  Sl)  ,  etc. 


Let  S  *  (S.).  _  beaJ  -joint  family  of  random  subsets  cor- 
J  J€Jq  O 

responding  to  probability  space  (  x  a^.g.Pr)  as  above.  Let 


*  :  x  a(X.)  -»  *(X  )  be  a  given  ordinary  set  operation  such  that  * 

j«J0 

Is  (  •  g  .  ,C rng( *)  immeasurable  for  all  x  «  X  .  Then  *®S  is  a 

j€j  J  lX)  ° 

J  o 

random  subset  of  XQ  ,  since  functional  inverses  preserve  all  ordi¬ 
nary  set  operations,  noting  X  -i  c.  .  =  a  .  ,  etc. 

\  X  /  x-f  \  X  / 


,  -1  , 


(*®S)  *(r( rng(*)))  -  S  1(*  ^(rngd))  £  S  A(  *  g  )  =  ®  g  . 

j€Jo  J  j€Jo  J 

Hence  (*»S)  1  (o  (r  (rng(  * )  ) )  )  £  ®  5.  .  *°S  corresponds  to  pro- 

3*J0  3 

bability  space  (rng(*),  o (r ( rng( * ) ) ) ,  Pr»*-1)  . 


-1 , 


Example  1.  Let  f  :  Xj  •*  X  be  arbitrary.  Let  Sj  be  a  ran¬ 
dom  subset  of  Xj  ,  corresponding  to  probability  space  (ai»Si»prs  )  , 

where  £  »(X)  ,  g^^  2  v(&^)  is  a  o-algebra  over  .  Consider 

then  the  induced  set  function  f  :  »(X1)  -»  *(Xq)  .  Then  for  any 

x  «  X  ,  f-1(c  (rng(f)))  n  a  =  c  (a,  )  .  Thus,  if  either  f 

0  [x>  1  f-1(x)  1 

is  such  that  f  1  is  at  most  countably  infinitely  valued  or 
c  (a  )  «  g  2  a  g  ,  g  a  o-algebra,  then  f(S,  )  will  be  a  random 
J«o  3 

subset  of  X 

o 


Example  2.  Consider  spaces  x  »(X.)  and  *(X  )  ,  with  joint 

J€jo 

random  sets  S  corresponding  to  probability  space  (  x  ».,g,Pr) 

jej  3 
J  o 

as  above.  Let  *  :  x  *(X.)  -♦  *(X  )  as  before.  Form  the  (compact 

j«Jo  J 

Hausdorff,  etc.)  topologies  3"(Bj),  j  «  JQ  and  ;r(rng(»))  .  Form 

the  natural  product  topology 
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•  9'<aj)  =  (  *  -A  j  X  X  I  6  ff1'  3  €  J1  S  Jo' 

j®Jo  3  l  3«Jj  3  J«VJ1 


Jj  arb. 


which  is  also  compact  Hausdorff,  etc.  Then  if  *  is  (  ®  7 (a.), 

jej  3 
J  o 

7 ( rng{ *))) -continuous ,  then  it  follows  immediately  that  *  is 
(9,  o  (r  ( rng(  *))))  -measurable  and  hence  *j3  is  a  random  subset  of 

Xo  • 

In  particular,  if  *  is  a  composition  operation,  i.e., 

J. 

3  <1),  :  (0,1)  -*  {0,1}  such  that  for  any  e  7{X^),  j  ,e  Jq  ,  let¬ 


ting  c  §  (cj)j-Jo.  <DS  9 

♦  (•<£>>  -  4>,0$c  . 


then  *  is  also  continuous  in  the  above  sense.  Thus,  n  ,  u  ,  h  , 
A  ,  etc.,  are  all  continuous  and  hence  measurable  operations  yield¬ 
ing  in  turn,  n  S  ,  ’J  S  ,  X  -«  S  (when  JQ  =  {0})  ,  all  legitimate 

random  sets. 

Specifically,  note  for  £  ?(X.j)  ^Xj  3  xi ) 

n  1c.  ,{rng(n))  =  x  c  (a.)  e  a  g.  ,  and  for  J  countably 

<x)  j«Jo  <x)  3  3  ° 

infinite  or  finite 

u_1c ,v> (rng(u) )  =  u  (C.  >(».)  x  x  a  )  e  ®  g 

1  '  j«Jo  *x'  3  kejH{j}  K  j€jQ  3 

C_1(C{x){rng(i)))  "  C(c{x}(rngi))  =  *XH {x} ( ™g±>  c  gQ  . 

All  operations  on  random  sets  here  can  be  shown  to  be  at  least 
measurable,  if  not  continuous. 


Appendix 

The  Natural  Topologies  and  Sigma  Algebras  for  the  Power  Class  and 
Membership  Function  Space  of  a  Given  Base  Space. 

Let  X  be  a  given  base  space,  {S3 ,9  (X) )  £  *o  £  a(X)  fixed  and 
recall  the  definitions  for  cc(»o)  ,  «sc(»o)  and  ec{®0)  -  for  any 

C  «  »(X)  ,  and  for  ?  ,  for  any  x  =  a  =  <aj)j€Jo  ' 

Xj  €  X  '  aj  €  {0«1)  >  J  e  JQ .  as  well  as  for  <f>  the  fundamental 

membership  mapping.  For  each  x  €  x  ,  the  set  {0,1  }x  can  be 
identified  with  {0,1}  and  is  assigned  the  natural  discrete  topo¬ 
logy  7x  -  7  =  »{0,1}  =  (0,  {0}  ,  {1}  ,  {0,1}}  ,  i.e.,  all  subsets  of 
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{0,1}  are  considered  open  relative  to  .  It  is  trivially  ver¬ 

ified  that  indeed  y  is  a  totally  disconnected  compact  Hausdorff 

space.  Hence  the  cartesian  product  topology  (or  topology  of  point- 
wise  convergence),  the  smallest  topology  making  all 

proj^  :  {0,1}X  -*  {0,l)x,  x  e  X  ,  continuous,  is 

*r  1  ( X )  3  top(  {proj”1  (y )  |  x  «  X})  =  {pro  j”1  (y )  |  x  e  x}1^ 

=  {{Ov  o  (X)  |  for  all  x,  a,  J  finite}  u  {{0,1}X}  u  {0} )U 
x  T  a  T  ^  o 

o'~  Jo 

«  ({0X  fl(X)  |  Xj  *  X,  J  ■  1. - d  :  m  k  1}  u  {0y  0  |  y.  e  X, 

j 

j  =  1 . n  ;  n  k  1}  u  {{0,1}X}  u  {0})^, 

y 

y‘(X)  is  the  natural  topology  for  {0,1}  .  As  a  product  topo¬ 

logy,  it  follows  that  y*(X)  is  also  a  totally  disconnected  compact 

v 

Hausdorff  topology  over  {0,1} 

Now  recalling  that  is  a  one-to-one  onto  mapping, 

r(aQ)  3  <}>-1  («r 1  (X) )  Q  aQ  is  a  disconnected  compact  Hausdorff  topo¬ 
logy  over  and  is  called  the  natural  topology  (or  the  power 

class  on  X  relative  to  aQ) .  Note  the  correspondences  for  any 

V  *2 . *»•  Yl'  •••'  Y„  ‘  X 

<WV  *  "  *0  ■ 

av .  /  ,  (»  )  =  $_1  (°  (X) )  n  a  , 

X-t  { x }  o  T  '  x ,  o '  '  '  o 

m  -1 

c,  ,  (a  )  =  n  c,  ,(a  )  =  *  (o  ,)  n  a 

{Xj . xm)v  o'  {x,}'  °  ~'i  ° 


X-MYj . Yn)  o 


(a  )  =  n  a 


X^  {x  , }  '  o 


(a  )  =  <t>  (o  )  n  a 

'  r\  '  i  '  r\  '  r 


e  ,  .  (a  )  =  a  -i  a„  , ,  . 

{yl"  '  •  '  yn  °  °  X-MYj - »Yn> 


=  u  {<p  A(o  )  |  a  s  o)  , 
x  *5 


since  C  o„  ,  =  o  .  .  This  also  implies  that  all  0  ,  J 

X.l  X ,  0  ^  Xj  ,Sj  o 

o  o 

finite,  are  clopen  (closed  and  open)  and  hence  also  compact  rela¬ 
tive  to  y'(X)  ;  hence  all  $  1(0  )  n  a  ,  for  J  finite,  are 

X  —  cl  _  O  O 

**  J  *  J 

o  o 


clopen  compact, 
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Note  also, 


4>  *(0 


xj  *~J 
o  o 


<*>»  "  *o  ■  c(x . x_><V  n  »*«y . y  ><V 


where 

)*5j  )  - 
o  o 


. »■  U'j-l . •>  0  . n*  «01J-1 . »'  ; 


»  -*  <*  1<°v  a  (X))  n  a  )  -  u  $“ 

°  5j  'Sj  ° 

1  (0  , 

2j  'Sj 

1  "V 

o 

o  o 

o  o 

m 

ra 

=  U 

aXH{X;J}(ao) 

u  n 

c 

j-1 

3=1 

yj 

Thus,  ir.  summary,  for  any  C  c  j  (X)  ,  Cc(a0)  .  ^x-iC^o*  • 

e  (a  )  and  all  finite  unions  intersections  and  complements 

c  o 

(relative  to  aQ)  are  clopen-compact  for  r(aQ)  • 

As  also  a  T^space  (and  compact)  ,  it  follows  that  any  finite 
subclass  of  aQ  is  closed  (and  compact)  relative  to  r(aQ)  • 

Furthermore,  for  any  C  «  *(X)  ,  since  cc(aQ)  =  n  C{X)(a0>  < 

xeao 

c_(a  )  is  closed  and  thus  compact  relative  to  r(a  )  •  Similar- 

Co  o 

ly,  a  (a  )  is  closed-compact  relative  to  r(a  )  <  while  e  (a  ) 

CO  O  w  o 

is  open  relative  to  r(aQ)  .  for  any  C  «  »(X)  . 

An  alternative  equivalent  approach  is  through  projections:  For 
any  C  «  J (X)  ,  define  the  general  projection  mapping 

p_  :  a(X)  -*»(C)  ,  where  for  any  D  e  a(X)  ,  p~(D)  =  D  n  C  .  Thus, 

c  c 

for  any  C  £  G  «  } (X)  , 

pencil  n  »o  »  ((€}  o  ?  { XH  G ) }  n  , 

■  W  n  *XH«HC1<V  ■ 

Pg1(<*>)  n  ao  =  aXHG(V  :  n  ao  =  VV  =  ao 

Pg1({G,)  n  *o  =  W  *  P01({G>  n  V  “  W  =  ao 
Pq1(0)  n  *o  =  0  ' 


Now  for  any  G  «  } (X)  , 

p^  (99  {G)  )  Q  aQ  =  { p~ 1  ( )  n  a^  |  a  :  a(G)}  is  a  o-algebra  (of 
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clopen  compact  sets  relative  to  f(ao)}  over  aQ  ,  where 

p~ 1  (-«. )  =  U  p'^C)  :  For  any  a  £  »(G)  ,  j  =  1,2,..., 

G  CeA  b  J 


3£<Pg1|  V  n  V  *  n  »o  ■ 

Tlp-'Mj)  n  »0)  -  p-‘(  n  »e  . 

®o  ■*  (P-1^)  n  ao)  =  p"1!*^)  i  a^  n  aQ  .  (*) 

Furthermore,  ■»{*  )  ^  u  (pI1(»»(G))  Q  a  )  is  a  Boolean  ring 
°  G«9-(X) 

over  aQ  ,  since  for  any  a^  £  »(Gj)  '  Gj  €  9"  (X)  ,  j  =  l,...,m  , 


m  -1  -1 
u  (pG  (•*,)  n  a  )  =  p  f 

3-1  J 


A  (  u  u ,  u  a  (  u  g  .  -t  G.) )  n  a  ) , 

m  i=1  J  “  £-i  4  J  ° 

u  G.  J 

l3=i 


111  -i  -l 

n  (Pr;  (A.)  n  a  )  -  P  , 

3=1  3  J 


(  "  G3 

Lj=l  3 


(  n  (a.  n  a  (  u  g.  h  G  . )  )  n  a  )  , 
j=l  J  t=l  J 


(and  equation  (*}  still  remaining  valid). 

It  is  not  difficult  to  see  that 

r(»Q)  =  top  (»<ao))  =  (i»(ao)),:|U=  «(*Q))U 
=  top  (<e(x)(»0)  I  x  «  X)  u  (aXH{x}(ao)  |  x  e  X)  u  <*o)  u  (0) ) 
=  top  (<cc(ao)  |  C  €  j  ( x ) }  u  {a>x_,c(*0)  |  C  e  MX))  u  (0)) 

=  <W  n  *X-<(G-.C)(V  I  C  s  G  e  ’<*>>"  ' 
noting  that  for  any  «  a(X)  ,  3  €  J0,  arbitrary  index  set, 


n  c  (a  )  =  C 
3«JQ  °3  ° 


f  uc,l(ao>  '  "  ac  '%>  =  V  <%>  • 

l3^jJ  j€Jo  j  3-J^ 


For  any  A  ^  £  ?(G^)  ,  G  ^  €  }  (X)  ,  j  =  1,2.  p"1^)  =  Pg1  (a2  ) 
iff  G,  £  G„  and  a„  =  p_1(A  )  n  (G„)  ,  or  vice  versa  with 

12  2  (j  j.  1  2 

G2  £  Gj  ,  etc.  In  particular,  note 
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Pg1({Gl))  =  p"^((C  |  Gx  £  C  £  G2})  . 

First  consider  the  Baire  o-algebra  (Halmos,  Theorem  C,  p.  221) 


o({C(x)(V  I  *  e  X)  u  (*Q))  =  o(*(»0))  =  ° (c0(®0) )  =  o(Mo(ao))  ' 

where 

c  (a  )  =  (ji  I  ji  £  s  &  -A  is  compact  G.  (at  most  countable 
o  o  o  o 

intersection  of  open  sets  for  r(»o))}  , 


v 

o 


<v 


=  <3(C0(*0) )  n  r(®0) 


In  turn,  the  Baire  o-algebra  is  contained  within  the  Borel  o- 
algebra : 


where 


o («(»0) )  S  a (C1 (»Q) ) 


o(r(aQ))  , 


c  (a  )  ^  {ji  |  a  s  &  a  is  compact  (equivalently  closed) 
relative  to  r(aQ))  , 

Clearly,  for  any  C  €  »(X),  ec(*o) ,  ®c(®0)  »  ec(»0>  €  °(»(*0)) 
(since  already,  the  first  two  classes  are  closed  compact  for  r(»Q)  ; 
while  the  last  class  is  the  complement  of  a  closed-compact  set  for 

r(sQ))  • 

Finally,  using  <}>  ,  note  the  analagous  Baire  and  Borel 

x  X 

o-algebras  with  respect  to  x  (0,1)  =  (0,1)  are 

X€X 


o ( (o  (X)  |  x.  €  X  ,  j  =  l,...,m  ,  mi  1)  U  { (0, 1>X> ) 

X  j  ,  1  J 

=  o ( (o  (X)  I  for  all  x,a,  J  finite)  U  {{0,1)X}) 

Xj  ,aj  ~  -  o 

o  o 

=  o(c‘o(X))  =  «Mo  ( Co(»(X))))  t 

v  Here  c'o(X)  is  the  set  of  ail  compact  (equivalently  closed)  Ga 

subsets  of  (0,1)X  relative  to  7’(X)  ,  # 

o  ( c  *  ( X )  )  =  a  {7  '  (X)  )  =  <J>  (o  (C  1  (a  (  X)  )  )  )  (Borel  o-algebra),  where  C^X) 

X 

=  class  of  all  compact  (equivalently  closed)  subsets  of  (0,1) 
relative  to  7  '  (X)  . 

See  also  Michael  [176]  for  basic  background.  For  further  de¬ 
velopment  of  the  topologies  of  collections  of  subsets,  see  also 
Badard  [9'],  where  also  applications  to  uniform  structures  for  flou 
classes  are  given,  which  in  turn  are  used  in  fuzzy  analysis  and 
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extensions  of  fixed  point  theorems. 


4.3  Coveraqe/Incidence  functions  and  Choquet's  theorem  on 
capacities . 

In  Chapter  3,  an  alternative  approach  to  uncertainty  measures 
was  developed  with  enphasis  on  the  finite  case,  although  extended 
to  more  general  situations.  In  this  section,  Choquet's  Capacity 
Theorem  is  used. 

As  in  4.1,  let  X  be  an  arbitrary  set,  and  Tq  be  the  space 

law  of  a  random  set  S  taking  values  in  (X) ,o (a ) )  .  Note  that 
Tq  :  J  (X)  -♦  [0,1]  ,  Tq(I)  =  Pr(S  n  i  *  0)  and  Tq  is  characterized 

as  follows : 


Theorem  7  (Matheron  [171]) 

If  T'  :  J(X)  -*  [0,1]  is  a  given  functional,  define 
P’  :  }(X)  x  }  (X)  -  [0,1]  by  P  ‘  (0, 1  *  )  =  l  -  T '  ( I '  )  ,  and 

recursively 

P'(I  u  {x},I')  =  P'd.I')  -  P'd.I*  u  (x))  , 

then  T'  is  a  space  law,  i.e.,  T*  determines  a  unique  probability 
measure  on  (9(X),o(9))  iff 

(i)  T' (0)  =  0 

(ii)  P'  i  0  . 


Note  that  Matheron' s  theorem  is  a  strengthening  of  Theorems  3 
(ii)  and  5  of  section  4.2. 

Now,  such  a  space  Jaw  To  defines  a  sequence  r  =  (ip  , n  i  1) 

of  functions  as  follows: 

For  I  =  (Xj,  x2,  ....  xn)  «  }(X)  , 

¥’n(xl,x2 . xn)  =  To<I)  ~  Pr(S  n  (x2 . xn)  *  0)  . 

In  a  dual  way,  we  define  n-polnt  probability  coverage  function 
of  a  random  set  S  as  follows: 

For  nil,  «x2  '  •  ’  '  ,xn!  =  Pr(S  2  (xi  ,x2  '  •  •  •  'xn> )  -  we 


have 


( 1 )  For  each  nil,#  :  X 

n 


[0,1]  . 


(ii)  For  each  nil,#  is  symmetric. 

(iii)  (#n>n  2  is  ^ecreasin9  in  the  following  sense: 


Vn  M,VjS  n.  #n(xr 


’xn>  *  *j(xr 


’Xj) 


(iv)  If  x ,=...=x  ,  m  £  n  ,  then 

l  m 
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VX1 


'  xm ' xm+ 1 1 


'V  =  *n-m+l(xl'xm+l‘ 


’V  • 


(v)  For  each  nil, 

Z  (-1)K+1*  {x.  ,x.  - iX,  )  «  [0,1]  , 

K  ai  ak 

where  the  summation  is  taken  over  all  integers  i^i,,...,^ 
(1  s  k  s  n)  such  that  M  ij  <  ij  <  ...  <  i^  s  n  .  This  can 
be  seen  by  observing  that  (yn>  n  i  1)  and  (#n>  n  ^  1 )  are 
dually  related  as  follows: 


(a)  VX1 . xn)  =  2  ^-1)k+1#k(xi1'xi2 . 

(b)  *n(x1 - ,xn)  =  ]>  (-l)k  lyk<xii'xl2' • • • 'xik> 


Indeed,  since 


n 


(S  n  {x1(...,x  )  *  0)  =  u  (x,  e  S)  , 

i=l 

n 

a  n  (X,  €  s)  , 

i=l  1 


(S  2  [Xj . Xn>) 


therefore,  by  Poincare's  expansion 

n 

if’n(x1 . xn)  =  Pr[^u^(Xl  e  s >  ]  = 

\  (-l)k+1Pr[x.  e  S)  n...n(x  e  s)] 
al  *k 

-  5  . xi, 1  • 


Examples . 

1.  Let  f  :  X  -♦  [0,1]  be  given. 

Define  #1(x1)  =  f(x1)  ,  and 

n 

VX1 . V  =  i21f(Xi)  • 

2.  Let  F  be  a  distribution  function  of  a  real-valued  random 
variable  Y  .  Then  the  random  set  S  =  [Y,+<*>)  has 

VX1 . V  =  F(121xi)  • 


Remark . 


For 


f  :  X  ■*  [0,1]  ,  the  sequence  of  functions 
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n 

fn^xl  ’  ‘  ‘  ,xn^  =  x  *s  similar  to  the  one  used  in  symmetric 

statistics  of  infinite  order  (Dynkin  and  Mandelbaum,  [53]).  By 
considering  symmetric  functions  as  functions  defined  on  sets,  we 
define  #  ( (Xj , . . . ,xn> )  =  7f(f(x))  where  the  product  IT  is  taken 

over  x  «  {Xj , . . . ,xn>  . 

The  following  form  of  Choquet ' s  theorem  on  capacities  (see 
Matheron,  [171])  is  useful  for  establishing  connections  between 
random  sets  and  uncertainty  measures.  (See,  e.g.,  Goodman  [90], 
Shafer  [228],  Hohle  [116]). 

Let  X  be  a  locally  compact  and  separable  space.  Denote  by  9  , 
*  the  class  of  closed,  compact  sets  of  X  ,  respectively.  Denote 
by  »(sf)  the  Borel  o-algebra  of  the  topological  space  9  . 


Theorem  8  (Choquet  [36]) 

Let  T1  be  a  function  defined  on  X  .  Then  there  exists  a 
unique  probability  measure  Pr  on  ($,»(?))  satisfying 

Pr(F  €3|FnK*0)=  T(K) ,  V  K  e  %  , 

iff  T'  is  a  u-alternating  Choquet  capacity  of  infinite  order  such 
that  OS  T1  S  1  and  Tt#)  =  0  . 


Remark. 

For  the  use  of  Choquet1 s  theorem,  and  Matheron 's  theorem 
(Matheron  [171],  p.  41)  in  establishing  a  connection  between  random 
sets  and  possibility  distributions,  see,  e.g.,  Nguyen  [187].  Note 
also  that  the  above  theory  essentially  may  be  thought  of  as  the 
plausibility  measure  analogue  of  Theorem  2  or  Theorem  3  of  section 
3.2,  or  as  a  form  of  Theorem  3  of  section  4.2. 


4 . 4  Multi-valued  mappings  and  integration. 

The  concept  of  expectation  of  a  random  set  S  can  be  formally 
defined  if  the  base  space  X  possesses  an  appropriate  topological 

structure.  For  example,  when  X  =  Ra  ,  one  defines: 

A  +  B  =  (x+y  |  xe  a,  ye  a>  ,  A,B  £  Rd  . 
aA={ax|x«A)  ,  a  >  0,A  £  Rd  . 


More  generally,  if  S  takes  values  in  a  separable  Banach  space  X 
with  norm  n  •  n  ,  and  Borel  o-algebra  a  ,  then  first  the  random  set 
S  will  be  called  simple  if  the  set  of  possible  values  of  S  is  at 


most  countable. 


Let  random  set  S  =  z  B.<J>  with  iisn 

j  =  l  3  j 


00 


Pr-integrable ,  so  that 


Z  Pr(A,)B.  converges  absolutely,  where 
J  =  1  2  3 


#  z  Pr  ( A .  )  B  . 
j  =  l  3  J 


Bj  e  a  and  Aj  e  x  ,  for  probability  space  (fl,A,Pr) 
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is  called  the  Bochn&r  Integral  of  s  ,  noting  $A  is  the  indicator 

j 

function  of  A  ^  .  For  a  general  random  set  S  ,  it  can  be  shown 

that  ii  SH  is  Pr-integrable  iff  S  is  Bochner  integrable,  i.e., 
there  exists  a  sequence  of  simple  random  sets  Sn  such  that 

<sn)n=1  ( 2,  •*  s  •  Pr-almost  surely,  and  for  each  n  ,  nSnit  is 

Pr-integrable  and  J  iiSn  -  SlidP(u)  -*  0  ,  as  n  -»  +»  .  (For  a  trea- 

W€Q 

tise  on  integration  of  multivalued  mappings,  see  Debreu  [45].) 

We  proceed  now  to  point  out  Robbins'  formula  (Robbins,  [219], 
p.  72)  relating  the  computation  of  the  expected  volumn  of  a  random 
set  to  its  probability  coverage  functions. 

Let  S  be  a  random  set  taking  values  in  c  s  B  (the  Borel 

m 

o-algebra  of  Rm)  ,  C  a  o-algebra  on  c  ,  and  u  or  vol  the 

m  m 

Lebesgue  measure  on  Rra  .  If  the  measure  ( S )  is  a  random 

m 

variable,  then  under  suitable  conditions,  its  expectation  E(pm(S)) 
can  be  obtained  as  follows  (by  use  of  Fubini’s  iteration  theorem). 


Theorem  9  (Robbins  [219]). 
Define 


g  :  Rm  x  C  -* 

[0,1] 

by 

g(x,A)  = 

f1 

if  X 

e  A 

=  $,(X) 

L  0 

if  x 

«  A  A 

®  c -measurable , 

then 

m 


E(Pm(S))  =  J  Mx)dpm(x) 


X€Rm 

where  *(x)  =  Pr(x  e  S)  ,  xe  Rm  . 

By  considering  many-point  coverage  functions,  and  under  suita¬ 
ble  measurability  conditions,  higher  moments  of  /^m(S)  are  obtained 

as  follows: 

For  k  a  1  .  E(/v  (S)k)  = 
m 


j  *(xi . xk>  ®  1 . dv 

Rmk 

where  =  p 


m 


m 


for  i  =  1,2 . k  ,  and 


i  =  l 


derotes  the 
m 


product  measure  on  R 


mk 


( 
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Remark . 

In  the  case  k  =  1,  the  measurability  of  g(x,A)  implies  that 
x(x)  is  measurable  and  that  A<m(S)  ls  a  random  variable.  One  can 

view  (g(x,-)  |  x  «  Rm)  as  a  0-1  stochastic  process  on  the  mea¬ 
surable  space  (c  ,c )  ,  so  that  the  measurability  condition  of  g  is 
equivalent  to  the  fact  that  this  stochastic  process  is  a  measurable 
process.  The  following  is  a  concrete  situation  where  Robbins'  form¬ 
ula  is  valid:  c  =  sc  ,  e  =  a{ac)  ,  i.e.,  S  is  a  compact-valued  random 

set  on  Rm  .  Indeed,  the  stochastic  process  (g(x,-)  |  x  e  Rm)  on 
(jc,a(jc))  is  measurable.  That  can  be  shown  as  follows:  It  is 
enough  to  show  that 

g-1 ( { 1 > )  «  ®ra  «  *(*)• 

Now,  viewing  X  as  a  set  whose  "points"  are  compact  sets,  and 
defining  a  multi-valued  mapping 

r  :  (*,»(*))  -  »{Rm) 

by 

r(  K)  =  K  , 

i.e.,  r  :  (JC,a(*))  -♦  (*,»(*))  ,  but  the  second  space  is  considered 

as  the  collection  of  all  possible  values  of  the  multi-valued  map¬ 
ping  in  Rm  .  It  is  obvious  that  r  is  a(*)  -  a (X) -measurable. 
Next,  observe  that  g-1({l))  =  G(r)  ,  the  graph  of  r  ,  defined  as: 

G  (r)  =  {  (x,K)  |  x  «  r  (K)  >  S  Rm  x  3E  . 

Since  Rm  is  metric  and  separable,  and  r  is  »(*)  -  a(*)- 

measurable,  it  follows  that  G(r)  e  a  a(X)  ,  Q.E.D. 

This  last  statement  is  a  special  case  of  the  following  theorem. 

Theorem  10  (Debreu  [45],  p.  360) 

Let  X  be  a  metric  space,  3"  its  Borel  o-algebra  and  r  a 
multi-valued  mapping  from  an  arbitrary  measurable  space  (H,x)  to 
»(X)  ,  with  values  in  *  (of  X)  .  If  X  is  separable  and  r  is 

x  -  a(JC)  measurable,  then  the  graph  of  r  is  in  the  product 
a-algebra  X  ®  <r  . 


Remark ■ 

If  f  is  a  single-valued  mapping  from  V  to  W  (Hausdorff 
topological  spaces),  then  the  graph  of  f  ,  defined  as 
{(v,w)  j  w  =  f(v)}  s  V  x  W  ,  is  closed  in  V  x  W  (with  product 
topology)  if  f  is  continuous.  But  the  Borel  o-algebra  of  V  x  W 
contains  the  product  o-algebra  o(V)  ®  o(W)  ,  with  equality  if,  in 
addition,  both  V  and  W  have  countable  bases  for  their  topology 
(see,  e.g.,  CourrSge  [42]). 
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More  generally,  for  random  closed  sets,  we  have: 


Theorem  11  (Matheron  [171],  P-  47) 

Let  X  be  a  locally  compact,  Hausdorff  and  separable  space 
(with  topology  tr  ,  and  Borel  o-algebra  denoted  by  o  ( a" ) )  .  Then 

any  random  closed  set  S  is  measurable  in  the  following  sense:  The 
mapping  g  :  X  x  9  -♦  (0,1)  is  a  (sr )  ®  a(s )-measurable . 

Indeed,  let  a  be  a  countable  base  for  the  topology  9  .  Then 
x  <  F  (for  F  «  9)  is  equivalent  to:  there  exists  D  e  a>  such 
that  x  e  D  and  F  n  D  =  0  ,  Thus , 

g_1({0})  =  U  (D  x  ?D)  e  o{3-)  e  a(?)  , 

D«a 

where  =  (F  «  ?  |  F  n  D  =  0}  . 


Corollary. 

(Matheron)  With  the  conditions  of  the  above  theorem,  let  p/ 
be  a  positive  measure  on  (X,o(3))  .  Then  the  mapping 

(F  «  9)  -*  pj(F)  =  f  g(x,F)  dpi(x)  ,  from  ($,»(»))  into  IR+  ,  is  a 
xeX 

positive  random  variable  (i.e.,  pi(S)  IS  a  positive  random  var¬ 
iable)  ,  the  expectation  of  which  is 

E(^(S))  =  J  Pr (x  «  S)  d^(x)  . 
x«X 

Indeed,  by  the  previous  theorem,  g  is  o (y )  e  a (3 ) -measurable , 
hence  F  -♦  pi(F)  is  a  ( 3 ) -measurable .  Next,  by  Fubini's  theorem,  we 
have : 

E(a/(S))  =  J  pi  ( F )  dPg  ( F)  =  J  J  g  ( x ,  F )  dPg  ( F )  dpi  ( x ) 

F«*  X  9 

=  J  Pg(F€  9  I  X€  F)dpi(x) 

X€X 

=  J  Pr (w  |  x  «  S(w)  }dpi(x)  . 

X«X 

Some  applications  of  Robbins'  Theorem  to  the  problem  of  deter¬ 
mining  the  class  of  random  sets  having  given  coverage  functions  up 
to  some  order  will  be  treated  in  section  5.4.  This  is  also  inti¬ 
mately  included  with  the  problem  of  relating  dispersions  to  random 
sets . 


CHAPTER  5 


RANDOM  SETS  AND  DISPERSIONS 


In  this  chapter,  we  establish  connections  between  random  sets 
and  dispersions,  discuss  the  principle  of  maximum  entropy  for  ran¬ 
dom  sets,  and  relate  our  analysis  to  some  familiar  situations  in 
statistics . 


5 • 1  Zero-one  stochastic  processes. 

An  alternative  approach  to  the  development  of  random  subsets 
‘rom  primitive  ones  and  relations  between  random  sets  and  uncer¬ 
tainty  measures  may  be  expressed  through  zero-one  processes. 

A  zero-one  stochastic  process  V  =  (V.)  .  is  a  collection  of 

J  1€J0 

random  variables  :  n  -*  {0,1}  ,  j  e  JQ  which  are  consistently 

determined,  i.e.,  if  K*  £  K"  £  Jq,  then  the  joint  marginal  dis¬ 
tribution  of  (V^)j€K,  remains  the  same  relative  to  K’  or  K"  . 
We  can  identify  each  with  the  probability  space 

{ {0,1} ,*{0.1>,  Vj),  Pr ( V j  -  1)  =  fc/  j ( { 1 } )  =  a.  =  0) 

=  1  -  ,  j  «  JQ  .  Since  each  such  space  is  trivially  a  Polish 

space  (a  complete  separable  metric  space  -  see,  e.g.,  Neveu  [184*], 
pp.  64  et  passim;  p.  83)  then  there  exists  a  unique  probability 
measure  v  which  extends  all  of  the  yj's  to  two  levels  as  fol¬ 
lows  in  terms  of  probability  spaces: 


Baire-level  :  ({0,1}  ,  o {Z  ^(Jq) ) ,  v)  , 

J 

Borel-level  :  ({0,1}  °,  o(C^(Jq)),  u)  , 

noting  o(c^(J))  £  o (c £ ( JQ ) )  .  For  any  C  £  G  e  J ( Jq ) 


Pr (  &  (V,  = 


1)  &  &  (V  =  0)  ) 

jeGHC  J 


=  /i*1)(C)  =  Z  (-i)card  ^(CUK) 
K£GHC 


=  v  (0  )  , 
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where  p  :  }  (G)  -*  [0,1]  is  the  doubt  measure  determined  by 

p( B)  =  Pr(  &  (V .  =  1)  )  ;  B  £  G  , 
j«B  3 

where  here  y  =  (j)jgG  ,  a  =  (aj)jeG  ;  for  j  e  C  ,  =  1  ;  for 

jeG-iC,aj  =  0.  Further  properties  such  as  finite-regularity 

and  completeness  of  measure  (at  the  Borel  level  extension)  also 
hold,  completely  analogous  to  the  situation  for  9(JQ)  through  the 

^-mapping . 

In  particular,  letting  JQ=  X  : 

Theorem  1 

Given  any  base  space  X  ,  {^,X}  £  £  *(X)  ,  and  any  zero-one 

stochastic  process  V  =  (V,)„v  indexed  by  X  ,  there  is  a  unique 
corresponding  random  subset  SQ  of  X  with  range  correspond¬ 

ing  to  probability  space  (aQ  ,  o( c1(aQ))»  us  )  induced  by  the 

o 

always  measurable  mapping  <J>  1  ,  i.e.,  v =  v°<$  ,  and 

ao 

o(C1(aQ))  =  <t>_1(o(c  •  (X)  ) )  n  ao  ,  etc. 

Remark. 

The  mapping  { <p  |  sq)  may  be  thought  of  as  a  random  ordinary 

membership  Function  corresponding  to  probability  space 
J 

({0,1}  ,  o(Cj(X)),  u)  .  The  above  result  also  shows  that  a  zero- 

one  process  is  uniquely  specified  by  all  of  its  joint  one-values. 

Theorem  2 

(i)  If  p  is  a  doubt  measure  over  J(JQ)  •  then  p  determines  a 

unique  0-1  stochastic  process  V  =  (VJ,{j  such  that 

3  3  o 

Pr(  &  (V.  =  1))  =p(C)  ,  V  C  €  1  (J  )  . 

j€C  3 

Indeed,  it  follows  that  for  any  C  £  G  e  J (JQ)  , 

Pr  (  &  (V  =  1)  &  &  (V  =  0))  =  A'l 1  ’  (  { C )  )  , 

jeC  3  JeGHC3 


where  p *  1 ^ 

tjr 


is  defined  as  before  in  Chapter  4. 
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(ii)  Conversely,  if  V  =  (V.)  -6  _  is  any  0-1  stochastic  pro- 

J  J  J  o 

cess,  then  V  is  uniquely  determined  by  the  specifications 

P(C)  =  Pr (  &  (V  =  1 ) )  :  C  e  J (J  )  , 
jeC  3  ° 

where  it  can  be  shown  p  is  a  legitimate  doubt  measure  over  }(Jo) 
and  all  results  in  (i)  are  valid  for  (i  . 

Remark . 

Thus,  the  class  of  all  zero-one  processes  can  be  placed  in  a 
one-to-one  correspondence  with  the  class  of  all  doubt  measures. 


Corollary  1 


Let  <|>&  ,  <for  be  any  semi-distributive  DeMorgan  t-norm  and 

t-conorm  pair. 

Then  for  any  A  «  s(X)  ,  the  doubt  measure  ^A  and  plausi¬ 
bility  measure  p  =  1  -  p  we  have  the  following  properties: 

A  r  Or  o  n  f  Ct 

(i)  PA  &  determines  a  unique  0-1  stochastic  process 

~ A , &  =  ^ VA , & , j 1 J  SUCh  that  f°r  311  C  €  ,(Jo>  ’ 

o 

Pr(  «  ' 'VA.«.j  =  1)5  =  PA,&(C>  * 

J€C  X€  C 


with  eq.  (*)  of  the  Appendix,  section  4.2,  also  valid  for  p 
replaced  by  p  V.  by  V.  .  .  . 


A  ,  &  '  ’j 


A ,  &  ,  j  • 


(ii)  P, 


determines  the  0-1  stochastic  process 


~A ,  or  =  ^  VA ,  or  ,  j  ^  je  Jq~  (1  “  V  CA , & , j > je Jo  such  that  for  a11 


C  e  1 ( Jq)  , 


Pr (  Or (V 


A ,  or ,  j 


i)>  -  /Vor(C>  -  WV*” 


5.2  One-point  coverage  problei 


A.  Background. 


Now,  given  any  space  X  and  any  fuzzy  subset  A  of  X  ,  there 
always  exist  {in  general,  many,  unless  A  is  an  ordinary  set)  ran¬ 
dom  subsets  S  of  X  such  that  S  ,  under  its  one-point-coverage 
function  is  the  same  as  the  corresponding  possibility  function  of 
A  : 
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<f>A(x)  =  Pr(x  €  S)  ,  for  all  x  «  X  . 


We  write: 


A  »  S  . 

In  particular,  we  may  choose 

s  ■  SU(A)  i  C'0-1'  ■ 

a  nested  random  subset  of  X  one  point  coverage  equivalent  to  A  , 
where  U  is  a  random  variable  uniformly  distributed  over  [0,1]  . 

If  it  is  subsumed  (and  it  is  throughout  where  required)  that 
(n,a»,Pr)  is  some  non-atomic  probability  space,  then  such  a 
U  :  n  -  [0,1]  ,  may  always  be  chosen.  (See  Theorem  3.) 

Or  choose  S  =  T(A)  ,  where  T(A)  has  a  (random)  membership 
function 


PT(A) 


=  (. 


T(A) 


(x) 


xe  a 


which  may  be  considered  as  a  mutual 'v  statistically  independent 
zero-one  stochastic  process,  where 


Pr((j)T(A)  (x)  =  11  =  <f>A(x)  ’ 

Pr  ( 4*  t  (  a  )  (*)  *  0)  «  1  -  <J.A(X)  , 


for  all  x  e  x  .  (See,  e.g.,  Goodman  [90],  [77],  [78]  and  Nguyen 

[190]  .  ) 

Conversely,  it  is  clear  that  any  random  subset  S  of  X 
generates  a  uniquely  determined  one  point  coverage  equivalent  fuzzy 
set  A  =  A ( S )  by  defining 

$A(s)(x)  =  Pr<x  €  S)  ;  ail  x  €  X  . 


Thus  each  fuzzy  subset  of  a  space  may  be  represented  by  an  entire 
class  of  random  subsets  of  the  same  space  through  the  common  one 
point  coverage  function.  This  also  implies  that  a(X)  .  the  class 
of  all  random  subsets  of  X  ,  is  partitioned  into  subclasses  of 
random  subsets  of  X  ,  which  each  such  class  represents  the  equi¬ 
valence  class  of  all  random  subsets  of  X  ,  all  one  point  coverage 
equivalent  to  a  given  (and  arbitrary)  fuzzy  subset  of  X  . 

Thus,  we  are  led  to  the  definition: 

A  (random  set)  choice  function  S  is  an  injective  mapping, 

S  :  *(X)  -  a(X)  ;  S(A)  »  A  ;  all  A  e  ?(X)  , 

i  .  e  .  , 

<f>A(x)  =  Pr  ( x  e  S  ( A )  )  ;  all  x  e  X  . 

Hence,  Sy  and  T  are  examples  of  choice  functions. 
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More  specifically.  Sy(A)  ^as  a  nestec*  range  -  /*A([0,1])  , 

where 

r A(x)  3  ^([x.l])  ;  x  e  [0,1]  . 

Sy  corresponds  to  induced  probability  space 

'V‘o.in  ■  •  “a’1  ;  "a'  i  v°VrA1  ■ 

where  b1  is  the  real  Borel  field  over  [0,1]  .  In  addition,  for 

all  C  e  »(X)  , 

CC  =  rA[0.inf  <j>A(x)]  ;  3C  =  rA[sup  $A(x) ,  1]  ; 

X€  C  C 

CC  =  rA[0'  SUp  $A(X) ]  ' 

X€  C 

with  evaluations 

Pr(C  £  S  (A))  =  inf  <p  (x)  ;  Pr(*  *  Sy(A)  S  C)  =  1-sup  «t>  ( x )  ; 

xe  c  xe  C 

Pr(S.(A)  A  C)  =  sup  <J>.(x)  ,  where  A  is  the  incidence  rela- 

xeC  A 

tion  ( Chapter  3 ) . 

Sy  also  has  the  maximal  coverage  function  among  all  choice 
functions  S  : 

Pr  ( C  £  SyfA))  i  Pr  ( C  £  S  ( A )  )  ;  for  all  A  e  ?{X)  ,  C  €  *(x)  . 

In  addition,  Sy  as  a  mapping  from  the  class  of  all  fuzzy  sub¬ 
sets  of  any  given  space  onto  the  class  of  all  nested  random  subsets 
of  the  same  space  is  an  isomorphism  -  up  to  the  one  point  coverage 
equivalence  »  -  between  the  following  Zadeh’s  fuzzy  set  operations: 
$ Q  =  (1  -  (•  )  ,  min,  max)  is  the  choice  of  the  fuzzy  set  system  - 

and  corresponding  ordinary  set  operations:  complements,  cartesian 

products,  intersections,  cartesian  sums^unions,  projections,  subset 
relations,  (defined  here  by  ordering  the  membership  functions  point- 
wise  ),  functional  and  inverse  functional  transforms.  (See,  e.g., 
Goodman  [88].)  For  example,  for  any  f  :  X  -»  Y  ,  and  any  fuzzy 
subset  A  of  X  , 

f ( A)  »  Sy(f(A))  »  f(Sy(A))  , 

i . e . ,  for  any  x  «  X  , 

4>f(A)(X)  =  Pr(X  €  Sy(f(A)))  =  Pr  (  X  €  f(Sy(A)))  . 

Indeed,  for  Sy  ,  there  are  even  stronger  relationships: 
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Sy ( A  U  B)  =  SyiA)  U  Sy(B) 
Sy(A  n  B)  =  Sy{A)  n  Sy(B) 


Sy (proj (A) )  =  proj(Sy(A)) 

Sy(f (A) )  =  f ( Sy { A) )  . 

but  note  that  although 


Sy(X  H  A)  *  X  ^  Sy(A)  , 

Sy(X  HA)  =  X  A  S1_y(A)  (+) 

for  all  A,B  €  5 (X)  . 

Eq.  (+)  serves  as  a  check  on  the  prevention  of  the  violation  of  the 
Law  of  Excluded  Middle  for  ordinary  sets:  For  any  x  m  X  , 

0  =  Pr  ( x  «  0)  =  Pr  ( X  e  Sy(A)  n  (X  -I  Sy(A))) 

i  Pr ( X  €  Sy(A)  n  Sy(X  h  A)) 

=  Pr(x  «  Sy(A)  n  (X  h  S1_U(A))) 

^An (XHA) 

=  min(<f>A(x)  .  <Px^A(x)  ) 

=  min(<J>A(x)  ,  1  -  <f>A(x)  )  . 


The  range  for  T  ( A )  is  ®Q  =  *{x  |  $A(x)  >  0} 

Pr ( C  s  T ( A )  )  =  Ji  <J>.(x)  ;  Fr(0  *  T ( A )  s  C)  =  ir  (1  - 
x«  C  xe  c 

Pr  ( T(  A)  A  C)  =  1  -  n  (1  -  <p  (x)  )  . 

xec 


and 

<t>A(x)  ) 


For  fuzzy  set  system  5^  =  (1  -  (•).  prod,  probsum)  ,  T  in¬ 
duces  isomorphic-like  relations  for  complements,  cartesian  products, 
intersections,  cartesian  sums,  unions,  projections,  etc.  (See  [90] 
and  [ 8  8  ]  .  ) 

See  also  the  following  papers  for  related  results  in  estab¬ 
lishing  one  point  coverage  representations  of  fuzzy  sets  by  random 
sets:  Orlov  [197],  Wang  [261],  Wang  and  Sanchez  [262],  Nguyen 
[187],  [190],  and  Hohle  [116],  [117],  Also,  Wang  and  Sanchez  (e.g., 
[264],  [265]),  extend  earlier  work  in  relating  fuzzy  sets  to  random 
sets  by  introducing  the  general  notion  of  "hyporf ields"  to  motivate 
random  set  definitions  and  properties,  and  as  well,  the  concept  of 
random  fuzzy  sets. 


B .  Flou  c lasses,  level  sets  and  nested  randoa  sets. 

With  a  random  variable  interpretation  for  indices,  Zadeh's 
early  result  [278],  as  well  as,  Negoita  and  Ralescu's  extensions 
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[184]  of  Gentilhomme ' s  flou  sets  [69]  (essentially  a  collection  of 
nonrandomized  nested  sets)  may  be  tied  in  with  one  point  random  set 
coverage  representations  of  fuzzy  sets  via  choice  function  Sy 

The  choice  function  Sy  is  definable  through  the  mappings 

r  ,  A  e  5(X)  .  The  ordinary  set  i  r  .(a)  is  often  called  in  the 

A  A 

fuzzy  set  literature  the  a-level  or  a-cut  set  formed  rrom  A  , 
a<=  [0,1]  .  As  early  as  1971  ([278]),  r  was  employed  in  repre¬ 

sentations  of  fuzzy  sets  in  terms  of  ordinary  ones,  via  the  equation 


<t>  .(x)  =  sup  (a-A  .  .  (X)  )  ;  all  X  <s  X  ,  A  €  7(X)  ,  (i) 

A  0<aS 1  rA[  1 


later  extended  by  Sugeno  [248*]  for  defining  fuzzy  integrals. 

(Note  that  integrating  both  sides  of  (*)  with  respect  to  x  leads 
to  the  standard  simple  function  representation  for  a  Aebesgue 
integral.)  a-cuts  may  also  be  used  to  motivate  techniques  for  the 
combination  of  evidence. 

But  notice  that  for  any  outcome  U 


X  €  Sy(A) 

iff 

fA(x)  a  U 

iff 

x  «  rA( o) 

iff 

♦rA(0)(x) 

X  «  Sy(A) 

iff 

A ( u ) (x) 

1  ; 

0  , 


whence 

E<*rA(U)(x)) 

r1 

j  «-$r  da  =  sup  (a.*  ,  ,(x))  , 

a=0  A '  '  0Sa< 1  a'  ' 

the  same  as  ( t  )  . 

Originally,  in  1968,  Gentilhomme  [69]  independent  of  Zadeh , 
envisioned  the  extension  of  an  ordinary  set  by  the  addition  of  a 
zone  where  set  membership  would  not  be  certain.  Negoita  and  Ralescu 
[184]  extended  this  concept,  called  riou  set,  to  that  of  an  arbi¬ 
trary  collection  (A  ) _ .  ,,  of  nested  sets,  sufficiently  continuous 
1  a'OSaZl  1 

with  respect  to  the  index  a  .  A  typical  flou  set  as  defined  here 
will  be  C  =  (Ca)a€[0jlJ  ,  where  C  e  9>(X)  , 

X  =  CQ  2  ••  2Ca2C^2  ••  2CX  :  l>/3>a>0  , 

with  continuity  condition 

Cv(K.)  *  n  Cq  ;  all  K  S  [0,1]  , 

ae  K 


Pr(x  €  Sy(A) )  = 


v  being  the  sup 


operat ion . 
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Operations  C  ,  U  ,  n  and  subset  ordering  £  may  be  defined 
over  ^MX)  ,  the  class  of  all  flou  subsets  of  X  ,  by  requiring 
these  operators  and  relations  to  hold  component-wise  for  the  flou 
sets  involved  (in  the  proper  order).  Ralescu  ana  Negoita  [164] 
first  showed  (replacing  the  unit  interval  range  of  fuzzy  set  member¬ 
ship  functions  by  a  lattice  L  with  sufficient  ordering  properties) 
that 

r  :  J(X)  -  9tlX)  ;  r  ( A)  =  =  * r  A(a  )  >ae  [  0 , 1  ]  :  A  €  *(x)  '  (#) 

is  a  bijective  mapping,  which  is  in  fact  an  isomorphism  preserving 
C  .  u  ,  n  ,  and  £  ,  where  the  fuzzy  set  counterparts  correspond  to 
fuzzy  set  system  ,  with  £  defined  here  by,  for  any 

A,B  e  7 (X)  , 

A  £  B  iff  <J>.(x)  5  $  (x)  :  all  x  e  X  . 

In  addition,  (analogous  to  the  isomorphism-like  properties  of 

Sy )  also  preserves  functional  and  inverse  functional  transforms. 

Extensions  of  these  relations  for  other  fuzzy  concepts,  including 
fuzzy  groups  and  fuzzy  dynamic  systems,  have  also  been  accomplished 
[184].  In  a  related  area,  P.adecki  [209]  independently  obtained 
similar  isomorphic  relations  for  level  set  mappings.  Earlier, 
Goodman  (1976),  referenced  in  Nguyen  [190]),  in  effect,  replaced  the 
non-random  parameter  a  in  eq.  (#)  by  the  random  variable  U  , 
uniformly  distributed  over  [0,1]  .  This  has  the  effect  of  re¬ 
placing  each  flou  set  in  eq .  (#)  by  a  corresponding  nested  random 

set,  namely  rA(U)  ~  •  In  addition,  there  is  an  obvious 

bijective  correspondence  between  ?£{X)  and  -KOt(X)  ,  the  class  of 
all  nested  random  subsets  of  X  .  Before  detailing  these  results, 
it  is  of  some  importance  to  establish  the  following  necessary  and 
sufficient  conditions  when  a  uniformly  distributed  random  variable 
may  be  used  in  terms  of  non-atomic  prbability  spaces  and  other 
criteria.  (See  [89].) 


Theorem  3 


Let  P  =  (ft, 0 i,Pr)  be  a  probability  space.  Define  a  flou  class 
A  =  (AQ)a€  [Q  j  £  3)  to  be  a  nested  class  of  sets  which  are  non¬ 
increasing  inclusion-wise  with  respect  to  index  a  and  which  are 
continuous  with  respect  to  union  and  intersection  relative  to  index 
a  .  Then  the  following  statements  are  equivalent: 

1.  There  is  a  random  variable  U  :  ft  -•  [0,1]  uniformly  dis¬ 
tributed  . 


2.  For  any  probability  distribution  function  F  over  ir  , 
there  is  a  random  variable  V  :  ft  -•  *  such  that  V  has  probability 
distribution  function  F  . 

3.  There  is  a  flou  subclass  A  =  (A  )  r  „  of  to  such  that 

a  ae [0,1] 

for  all  a  e  [0,1]  , 
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Pr ( A  )  =  1  -  a  . 
a 

4.  T  is  a  non-atomic  probability  space. 


Proof . 

1  implies  2:  Let 

V  =  FT (U)  , 

where  pseudoinverse  FT  is  defined  by 

Fl (t)  =  inf  F-1(inf{x  j  t  5  x  e  range(F)}) 
whence  for  all  t€  [0,1]  ,  xe  R  , 

Ff  (  t )  5  x  iff  t  S  Ft'-) 

1  implies  3:  Let 

Aa  =  U_1 ( [a , 1 ] )  ,  for  all  a  e  [0,1]  . 

3  implies  1:  For  all  u  e  n  ,  define 

U(w)  =  sup  {a  |  a  e  [0,1]  &  u  €  A^}  , 

which  implies  for  all  a  e  [0,1] 

Pr(U-1( [0,a] ) )  =  Pr(fl  H  Aa)  =  a  . 


4  implies  3:  Use,  e.g.,  the  standard  result  [104’],  pp.  168,  174. 

3  implies  4:  Suppose  there  is  an  atom  B  e  a  .  Then  use  continu¬ 
ity  of  probability  to  obtain  the  desired  contradiction. 

Thus,  from  now  on,  P  is  assumed  to  be  some  fixed  non-atomic 
probability  space,  which  thus  may  also  be  used  to  generate 
»(X)  =  [S  |  S  :  (1  -*  »(X)  is  measurable}  which  will  include  all 

random  sets  of  interest  such  as  Sy(A)  ,  T(A)  ,  etc. 

Let  A  e  9(X)  be  arbitrary.  Consider  now  the  mapping  S 
without  regard  to  the  random  variable  U  .  That  is,  define  the 
family  of  all  level  sets  associated  with  A  as  in  eg.  (#): 


r{  A)  =  (<*»/(  [a,  1])) 


ae  [0,1] 


and  Lev(X)  as  the  collection  of  all  level  families  r(A)  for  all 
A  e  ‘J(X)  .  Operations  between  level  families  are  defined  component¬ 
wise  in  terms  of  ordinary  set  operations  at  each  level  a  ,  including 
complements,  intersections,  union,  subset  relations,  functional 
transforms,  projections,  etc. 


368 


Goodman  and  Nguyen 


Theorem  4 

Let  X  be  any  fixed  space  and  defined 
tion  of  all  flou  classes  of  X 


~  ^o^aefO,!] 


9£(X)  as  the  collec- 
s  o>  (  X )  with 


AQ  =  X  .  Then 

1.  ‘Jt(X)  =  Lev(X)  ,  where  for  any  A  €  IX) 

A  =  r(BA)  ,  Ba  e  *(X)  . 

where,  letting  U  be  a  random  variable  distributed  uniform  [0,1] 


(X)  =  sup  {a  |  a  €  [0,1]  &  x  €  A  } 

A 

=  fi(x  t  ;  all  x  e  X  . 

Conversely,  if  A  e  S(X)  ,  then  r( A)  e  5f(X)  . 


2.  Let  the  collection  of  all  nested  random  subsets  of  X  be  de¬ 
noted  by  xa(X)  ,  where  it  is  assumed  that  for  any  S  e  xoi(X)  , 
range(S)  €  ?r(X)  .  Then  s(X)  ,  ?£(X)  ,  and  x»(X)  are  all  bijec- 
tively  related  as  in  the  following  diagram: 


Isomorphisms  or  isomorphic-like  relations  hold  for  all  of  the  above- 
mentioned  operations  defined  over  each  of  the  three  spaces. 

Proof . 

The  only  thing  new  to  show  is  that  is  surjective.  Let  S 

be  any  nested  random  subset  of  X  with  the  property 

range(S)  =  (A  )  „ „  .  ,  €  7£(X)  .  Thus,  there  is  a  random  variable 

a  '  a 'ae [ 0 , 1  ] 

V  :  n  -  [0,1]  such  that  for  all  wen,  S(u)  =  Ay(u)  .  Ey  part  1 

of  this  theorem. 


S(w)  =  4>  1  (  [  V  ( w  )  ,  1  ]  )  :  all  wen. 

range ( S ) 

Then  letting  F  be  the  probability  distribution  function  for  V 
and  using  the  proof  of  Theorem  3,  for  any  x  «  X  , 


x  €  S 


iff 

iff 


V  s 

u  s 


m  (x)  iff  FT  (U)  s  <|>b 

“range { S )  range(S) 

F  ( (x)  )  iff  xeSyfC), 
range ( S ) 


(x) 
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where 

<t>r(x)  =  F($  (x))  ;  all  x  e  X  . 

range(S) 

Hence,  S  =  (in  distribution). 

An  immediate  consequence  of  Theorem  4  is  that  for  any 
A  e  5 (X)  ,  there  is  a  unique  nested  random  subset  of  X,  S(A)  «  A  , 

namely,  S ( A )  =  S^fA)  .  (See  again  [89]  for  further  details.) 


C .  Additional  results  for  the  one  point  coverage  problem. 

Case  I . 

Let  X  be  a  finite  space  and  A  e  ?(X)  with  X  -♦  [0,1] 

given . 

Let  S  €  a(X)  be  arbitrary,  i.e.,  S  :  (i  -*  o«(X)  with 

S  *(c  )  €  *  for  all  x  e  X  ,  where  (n,*,Pr)  is  a  given  proba- 

\  X  / 

bility  space.  Note  that  S  is  bijectively  related  to 

<?>S  =  Jx€X  *  a  zero-one  stochistic  process,  where  for  any 

C  e  *(X)  , 

Pr(S  =  C)  =  Pr(<})s  =  $c) 


=  Pr  (  &  (<fo(x)  =  $r(x)  )  ) 
xex  S  C 

=  Pr (  &  (<|>_ (x)  =!)&(&  (<t>- (x)  =  0))) 


X€  X“i  C 


=  2  (-l)Card  * 


KSX-tC 


:ps(c  o  K)  , 


for  any  B  €  v ( X )  , 

PS(B)  3  Pr  (  &  (<(>  (x)  =  1))  =  Pr  ( B  £  S)  . 

X€B 

Now  by  Theorem  4  of  Chapter  3,  since  Pr(S  =  •  )  is  a  proba¬ 
bility  function  over  ?(X)  ,  it  follows  that  is  a  doubt  measure 

over  *(X)  iich  uniquely  determines  S  .  Note  the  convention 
^s(0)  =  1  • 

In  particular,  if  S  is  constrained  so  that  S  *  A  ,  i.e., 

they  are  one  point  converage  equivalent,  this  is  equivalent  to  re¬ 
quiring  each  x  e  X 


Pr($s(x)  =  1  )  =  <|>A(x) 


yielding 
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Pr(<}>s(x)  =  0)  =  1  -  $A(x)  < 

which  means,  for  all  x  e  X 

Ps((x})  =  Pr  ( x  e  S)  =  Pr  (4>s  ( x)  =  1)  =  $A(x)  . 

Now  let,  for  each  x  e  X  ,  U  be  such  that  w.l.o.g. 

x  a 

Ux  :  n  -*  [0,1]  is  a  uniformly  distributed  random  variable,  and 
U  =  is  a  stochastic  process  (with  all  well-defined  and 

consistent  joint  c.d.f.'s).  Identify  the  events 

<t>s(x)  =  1  iff  U„  s  $A(x) 

<t*s  ( x)  =  0  iff  Ux  >  <fA(x) 

for  all  x  e  X  .  Thus,  here 
S  =  A  . 

Thus,  for  any  B  e  9(X)  , 


PS(B)  =  Pr (  &  (U  s  <|»  (x)  ) ) 
X€  B 


and  thus  fj g  and  hence  S  are  specified  by  U  . 

In  particular,  choose  ,  any  n-copula.  This  uniquely  deter¬ 

mines  U  a  corresponding  well-defined  process,  where  for  any 


B  e  v(X)  and  constants  (c 


X  «  B)  ,  c^  e  [0,1],  x  e  B  , 


pr(  a  <UX  *  cx))  =  (cx)  . 

X€  o 

X«B 

In  turn,  <]>&  yields  S&  ,  where  for  any  B  e  9>(X)  , 

fj  (B)  =  Pr  (  <S  (<p  (x)  =  1))  =  Pr  ( B  £  S,) 

&  X«B  bSc  & 

=  PV(  &  (U  5  <t»A(x)  ))=<(>,  («J>A(X)  ) 

XeB  xeB 

and  hence,  for  any  C  €  ?(X) 


Pr(S&  =  C) 


=  5 

KSX-4C 


(-l)Card  (C  0  K)  . 


For  =  min,  Ux  =  U  ,  all  x  €  X  and 


S&  =  SU(M  . 
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For  $  =  prod,  all  U  's  are  statistically  independent,  for 

x  e  X  and 

S&  -  T ( A )  . 

For  t?>&  =  maxsum  and  n  =  2  ,  letting  X  =  {x^.x^}  with 

$A(x  )  +  ^A  ^  X2  ^  S  1  '  i,e-'  is  3  deficient  or  ordinary  pro¬ 
bability  function,  depending  on  whether  +  ^A^x2^  <  1  °r 

(^(x.,)  +  ^A^x2^  =  1  '  respectively,  it  follows  that 

Pr(S&  =  {xlfx2))  =  a/s  ({x1(x2>)  =  0  , 

& 

Pr(S&  =  {xx})  =  ^s({x1>)  -  /is<<xlfx2>)  =  , 

Pr(S&  =  { x  2 }  )  =  ps({x2))  -  )JS(  (Xj  ,x2)  )  =  ^A(x2)  ' 

Pr(S&  =  0)  =  /Js({x1))  -  ps({x2>)  +  ^s({x1,x2))  + 

=  1  -  *A(Xj>  -  W  • 

Thus,  here  S&  coincides  essentially  with  the  random  variable 

determined  by  $  .  This  case  corresponds  to  U  =  1  -  U 
A 

More  generally,  if  Z  (J>_(x)  £  1  ,  again,  $  corresponding  to 

xeX  A 

a  deficient  or  ordinary  probability  function,  S  may  be  identified 
with  the  random  variable  V  ,  say,  determined  by  $>A  ,  i.e., 

S  =  {V)  . 

Equivalently,  for  any  C  <=  ?(X)  , 

'  0  ,  if  card  C  <t  2 

Pr ( S  =  C)  =  $A(x)  ,  if  C  =  (x) 

1-  z  (x)  ,  if  C  =  0, 

x«  X 

and  thus  ( $  ( x )  )  v  is  a  disjoint  0-1  process,  i.e.,  at  most 

b  X€  X 

only  one  $s(x)  may  take  one  in  value.  Of  course,  this  Implies 

S  a  A  . 

Case  II 

Let  Y  S  iRn  be  arbitrary  with  <p  :  Y  -*  [0,1]  given.  Then 

A 

for  any  finite  subset  X  of  Y  ,  construct  random  set 

Sv  :  n  -  ?(X)  or  equivalently,  zero-one  process  ( <$>  (x))  as  in 

X  b v  X€ X 

X 

► 
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Case  I.  Clearly,  the  entire  collection  ($s  ‘x^xeX  yields  all 

X  Xej(Y) 

joint  c.d.f.'s  being  consistent.  Thus  Kolmogorov's  Extension 

y 

Theorem  may  be  applied  to  yield  a  probability  space  {{0,1}  ,  By,y) 

such  that  all  finite  joint  c.d.f.'s  of  (<$>  (x))  v  are  marginal 

sx  xeX 

c.d.f.'s  relative  to  v  ,  and  thus  relabelling,  >  =  ($g(x))x€y 

a  well-defined  stochastic  process  with  joint  probability  measure  u 
and  hence  we  may  -take 

S  =  4>-1{r)  , 

i  .  e  .  , 


*s  =  r  • 

as  the  desired  random  set. 

In  particular,  any  choice  of  infinite-copula  cf) _  results  in  a 

well-defined  uniform  process  (U  )  v  ,  each  U  distributed  uni- 

form  [0,1]  ,  in  turn  determining  a  unique  zero-one  process 

(<f>g(x))  y  over  Y  and  equivalently  S  e  *(Y)  . 


D.  The  one  point  random  set  coverage  problem:  a  general 
solution  for  the  finite  case. 

A  basic  question  may  be  posed:  Given  A  e  5(X)  ,  what  other 

random  subsets  G  of  X  exist  besides  Sy(A)  and  T(A)  such  that 

S  =»  h  ?  Some  results  are  given  in  [88],  [89]  where  a  family  of  ran¬ 

dom  subsets  of  X  is  obtained,  which  includes  ( A }  and  T(A)  as 

members,  all  one  point  coverage  equivalent  to  a  given  A  .  However, 
this  does  not  exhaust  all  possible  such  random  sets.  The  problem  of 
determining  all  possible  random  sets  one  point  equivalent  to  a  given 
fuzzy  set  is  called  the  one  point  coverage  problem.  (The  problem  of 
determining  all  random  sets  m  point  coverage  equivalent  to  a  given 
fuzzy  set  i*=  treated  in  the  following  section.)  Let 

yj(A)  =  {S  |  S  €  *{X)  &  A  «  S)  . 

Suppose  form  now  on,  X  is  any  space  with  n  =  card(X)  >  3  and 
order  all  2n  sets  C  e  3>(X)  by  order  s  where  if 

card(C’)  <  card(C")  then  C'  Z  C"  and  if  card(C’)  =  card(C")  , 
determine  an  arbitrary  but  fixed  and  consistent  order  also.  The 
notation 


f"l 

=  m ! / ( ( m- j ) ! j ! ) 

,  j  -  0,1,, 

.  .  ,m  ; 

;  1  = 

r 

:  0  = 

O' 

u  J 

Li. 

.6, 

a  =  2n  -  n  -  1 
n 
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will  be  used,  and  denoting  vector  and  matrix  transpose:  by  suner- 
script  Tr,  define. 

*  ( n ) ^  =  (1^rnlTr'  a‘irmTr .  (n  -  1)-~1fn-|Tr>  (1  by  V  : 


Cx  =  (^C(X))C^(X)  (an  bY  1J  '  311  X  £  X 
card ( C  )£  2 


C(n)  =  ({x>«X  <an  by  n)  : 


7  1  *■'  (S)  =  ( Pr  (  S  =  C) 


7C6^(X) 
card ( C  )£  2 


lan  by  1)  ;  all  S  e  at ( X )  ; 


v[K)  =  (4>a(x>)x€X  (n  by  1)  ;  r(A)  =  max  (  Z  ( ( x )  )  -  1,  0) 

X€  X 

all  A  e  5 (X)  . 


Theorem  5 

For  any  finite  space  X  (with  notation  as  above)  and  any 
A  e  7 (X)  , 


1  . 

where 


^(A)  =  (S  |  S  e  «t(X)  &  S)  e  ai(A)} 


»1(A)  =  (W  |  w  e  ir  is  arbitrary  satisfying  eqs .  (l)  -  (3))  , 


MAI  s  «|n)Tr'W 

(i) 

2  '(„)Tr'w 

(2) 

o  s  w 

~  a 

(3) 

n 

is  a  closed  convex 

region  in 

i  +  n  +  an  =  2n  hyperplane  bounds  with  an  uncountable  infinity  of 
possible  elements  7^  ^(S)  in  it,  or  equivalently,  S  e  y^(A) 

3.  There  is  a  bijective  correspondence  between  ^(A)  and  a^A) 
where  for  any  given  W  €  tt^A)  ,  S  e  r 1  ( A )  is  uniquely  determined 
(in  distribution)  by 


7 '  ' (S)  =  W 

Pr(S  =  0)  =  l  -  z  («|>  (x))  +  k,  / 

X€  X  (  n  ' 


Pr(S  =  (x)  )  =  <$>  (x)  -  1. 


W  ;  all  X  €  X  . 
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Define  now  the  vertex  set  ‘v'j  (A)  of  a  (A)  as  the  set  of 

W  e  a  (A)  such  that  equality  holds  in  eqs .  (1)  -  (3)  for 
linearly  independent  columns  of  ( *  (  n )  '  *  ( n ,  '  Xa  ^  ’  There  can  be  at 


vertex  elements.  It  also  follows  that  for  all 


nj  ln+1 


A  e  ^ (X)  : 


Range(S0(A>)  =  {$A  (  [y^  ( A)  ,  l  ]  )  |  j  =  1 - r}  , 


where 


range((pA)  =  { y  1  (  A )  ,  .  .  ,  yr  (  A )  >  ;  0  *  y^A)  <---<Yr(A)  <  1 


Hence , 


Pr(Su(A)  =  C)  = 


yj  ( A)  -  y  (A)  if  c  =  4>a  ([yj(A).i]) 
for  J  =  1 , . . , r  ;  Y0(A)  =  0  , 

0  ,  for  C  otherwise  . 


v  (SyfA))  e  A)  ,  where 

X(n)Tf  •  ''(2)(Su(A))  =  X  (4»a(x))  2  r(  A)  , 

X€  X 

'(n)Tr  '  *  -l*>  • 

(b)  Pr  ( T  { A )  =  C)  =  n  (<J>  { x )  )  -  n  (1  -  <f>  (x))  ; 

xeC  xeXHC 

all  C  e  a (X)  . 

(  2  ) 

j  (T(A))  e  interior ( A) )  in  general,  where 

Tr  ■  --(2)(T(A))  =  Z  (<p.(x)  )  -  1  +  tt  (1  -  <p.(x))  >  r(A) 

xeX  X€X 


CvTr  ^(2)  (T(A)  )  =  <K(X)-(1  -  ff(l  -  <|>.  (y )  )  )  *  <K(x)  ;  all  x  e  X  . 

X  A  yex-l{x}  A  A 

(c)  It  should  be  noted  that  the  condition  given  in  eq .  (1)  is 

superfluous  iff 


Z  (<|>A(x))  S  1  , 


i.e.,  <f>A  is  either  an  ordinary  probability  function  (equality 
holding  in  (4)  or  a  deficient  probability  function  (inequality 
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holding  in  (4)  over  X  ,  in  which  case  random  set  S' (A)  e  *  (A)  , 

where  S' (A)  may  be  identified  with  a  random  variable  over  X 
having  $A  as  its  probability  function  (possibly  deficient).  Thus 

*  (2)  (S1  (A)  )  =  C>  , 
n 

and  hence  -r<2'(S’(A))  e  r^(A)  .  Also, 

Pr(S*  (A)  =  0)  =  1  -  z  <J>,(X)  , 

X€X 

Pr  ( S 1  (A)  =  (x))  =  $A(x)  ;  all  x  €  X  . 


E .  One  point  coverage  functions  and  random  intervals.  [89] 

An  important  class  of  random  sets  is  the  random  intervals.  In 
this  section,  the  one  point  coverage  problem  is  specialized  to  the 
case  where  base  space  X  =  R , A  e  7 ( R )  ,  and  y  ^ ( A )  is  replaced  by  a 

more  restrictive  class  y  (A,Q)  ,  the  class  of  all  S  e  Q  such  that 

S  <*  A  ,  for  various  classes  Q  £  *  ( R )  of  random  closed  intervals  of 
R  . 

Define  first  oi  as  the  class  of  all  random  closed  intervals 
and  o2  as  the  class  of  all  random  closed  intervals  S  with 
Pr(S  =  0)  =0  .  Then: 

(1)  There  is  a  natural  identification  between  Qj  and  the  class  of 

all  bivariate  random  variables  Z  =  over  the  upper  half  dia- 

2 

gonal  plane  in  R  ,  and  ’'-tween  0.  ^  and  the  class  of  all  random 
2 

variables  over  »t  .  From  now  on,  let  S  =  [V,W]  denote  a  random 
interval  with  V  and  W  r.v.'s  over  R  with  marginal  c.d.f.  for 
V  denoted  by  F^  ,  for  W  denoted  by  F2  ,  and  joint  c.d.f.  by  F  , 

etc.  The  convention  [a,b]  =  0  for  a  >  b  will  be  used. 

(2)  For  any  A  6  sF  ( R )  ,  S  =  [V,W]  e  ^(A.O  )  iff 

<t>A(x)  =  F1(x)  -  F(x,x)  ;  all  x  <=  R  , 

the  solution  which  for  F^  and  F  in  terms  of  $A  may  be  compli¬ 
cated  unless  is  further  specified. 

(3)  Letting  a  3  be  the  class  of  all  random  intervals  S  =  [V,W] 

with  V  and  W  statistically  independent. 
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Pr(S  *  a)  =  J  F1 (x) ■ dF2 (x)  , 
xeR 

and  for  any  A  e  ?  (  ik  )  ,  S  =  [V,W]  e  iff 

<t>A  (x)  =  Fj  (x)  ( 1  -  F2  ( x)  )  ;  ail  xe  r  .  (1) 

which  implies  that  log(<|>A)  is  of  bounded  variation  with 
1  im  <f> .  (  x )  =  0  . 

A 

X-»±® 

(4)  If  At  7(R)  is  such  that  <f>A  is  unimodal  (which  will  De  in 

the  sense  that  possibly  a  neighborhood  of  modal  points  exists  and 
continuity  from  the  right  holds)  at  some  xQ  say  at  which 

<t>,(x  )  =  1  ,  then  S  =  [V,W]  e  ?  (A  Q_)  where 

AO  1  o 

r  <J>  (x)  ,  if  X  <  xQ  fo  '  ,  if  x  <  xq 

FJ{X)  =  [  l  ,  if  x  £  xq  :  F2('  ]  =  [  1  -  4>a(x)  ,  if  x  >  xq 

(5)  Let  q4  be  the  class  of  all  random  intervals  S  =  [V,W]  with 


V  and 

w 

statistically 

independent  and 

identically  distributed. 

If  A  € 

•J  (R) 

,  then  f . ( A ,  Q 

)  *  a  iff  lim 

(<t>A(x)  )  =  0  and 

*A  15 

X->±' 

30 

unimodal  at 

some 

x  with 

o 

W  s  1/4  ' 

in  which  case 

VA'Q4 

)  = 

(  S  (  A  )  ) 

where 

S ( A)  =  [V.W] 

is  determined  by 

solving 

eg .  (  1  ) 

for 

F1  =  F2  '’,n  terms  $A  : 

[  (1  -  (1 

-  4-<Pa(x)  )  1/2  ) /2 

,  if  x  <  x 

o 

syx> 

=  f2( 

x)  =  | 

[  (1  +  (1 

-  4-<|>a(x)  )1/2)/2 

,  if  x  £  x 

o 

(6)  Let  Q_  be  the  class  of  all  nested  random  closed  intervals. 
Then  using  Theorem  2, 

°5  =  I  A  €  5  ( R )  &  <t’A  is  continuous  and  either:  uni¬ 

modal,  non-increasing,  or  non-decreasing  over  R)  , 

where  as  usual,  U  is  a  fixed  random  variable  uniformly  distributed 
over  [0,1]  . 

Also,  for  any  A  «  7  (  ir  )  with  continuous  and  either:  uni¬ 
modal,  non- increas ing ,  or  non-decreasing,  t^(A,Q^)  =  (Sy(A))  ,  where 

it  should  be  noted  that 
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VA) 


(-•*>,  sup  (4>a1(U)  )  ]  ,  if  $A 

[  inf  (t?”1  (u;  )  ,  +«]  ,  if  $A 


is  non-increasing 
is  ncn-decreasing 


(Examples  of  monotone  fuzzy  sets  include:  "tall",  "short", 

"oid" ,  etc . ) 

In  a  related  vein,  note  that  if  <fA  is  any  prob.  dist.  f  unc . 
over  IR  coresponding  to  random  variable  Z  ,  say,  then  Z  can  be 
identified  with  ’e  Theorem  3)  ,  Sy(A)  =  [  Z ,  +» )  and 

<K(x)  =  Pr(Z  5  x)  =  Pr(x  €  [  Z , +» )  )  ;  all  x  e  ir  . 

Two  more  interesting  classes  of  random  internals  remain  to  be  dis¬ 
cussed  . 


Theorem  6 


Let  oe  denote  the  class  of  all  fixed  length  random  closed 

D 

intervals.  Let  A  €  3(r)  .  Then  A,QC)  *  0  iff  there  exists 

1  b 

prob.  dist.  func.  Fft  and  positive  real  constant  b^  such  that  for 
all  x  €  X  , 

VX>  =  Fa(X  t  bA)  -  FA(X  -  V 

iff  t>A  is  integrable  over  »<  and  defining 


2-  bA  =  J  (  M  )  ■  dx 

X<=  IR 

and 

k 

Fa(x  +  (2k  +  1 )  -  bA )  =  Z  (<f>A(x  +  -  J  *  bA )  )  ;  all  x  e  [0,2bA] 

j  =  -® 

k  =  0  ,  ±  1  ,  i  2 . 


Fa  is  a  legitimate  prob.  dist.  func.,  in  which  case 
f  ( A , Q  _ )  =  { S ( A ) }  ,  where 

l  b 


3(A)  =  [V  -  b.  .  V  +  bj  , 


where  V  is  a  random  variable  having  prob.  dist.  func,  Fa 
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Theorem  7 

Let  denote  the  class  of  random  closed  intervals  of  the 

form  S  =  [V  -  Wj,  V  +  ]  ,  where  V  is  a  random  variable  over  R 
statistically  independent  of  random  variables  Wj  and  jointly 

defined  over  R  +  x  R  +  .  Then  for  any  A  e  ?(R)  , 

f 1 ( A )  *  0  iff 

*A  =  fA  *  GA 

where  f .  :  R  -»  [0,1]  is  unimodal  at  0  with  f  ( 0 )  =  1  , 

A  A 

lim  (f  (x))  =  0  ,  Ga  is  some  prob.  dist.  funct.  over  R  ,  and  * 
x-»  ± 

denotes  the  convolution  operator;  in  which  case, 

S(A)  =  [V  -  ,  V  +  W^]  e  y^(A,Q^)  where  V  has  prob.  dist.  f unc . 

G  ,  W  has  prob.  dist.  func.  1  -  I  )  over  R+  ,  W  has  prob. 
A  1  A  2 

djst.  func.  1  -  f A ( • )  over  R+  ,  with  jointly  arbitrary 

and  statistically  independent  of  V  . 

It  follows  that  any  A  e  5(R)  for  which  $A  is  uniformly 

continuous  and  integrable  over  R  may  be  arbitrarily  uniformly 
closely  approximated,  in  the  one  point  coverage  sense,  up  to  some 
scalar  multiple,  by  some  S  in  .  (Approximate  <p A  by 

c-  f  *  G  ,  where  c  =  c  /((2w)1^2-o)  ,  c  =  ("  $  (x)dx, 

O  O  J  A 

X€lR 

1/2 

f  =  ( 2n  )  ■ a ■  f  ,  fQ  being  the  probability  density  function  for 

2 

Gaussian  distribution  N(0,o  ).) 

The  construction  of  such  an  approximation  (given  in  [85]), 
f  *  G  to  >  thus  yields  the  approximation  desired  one-point- 

coverage  equivalent  closed  random  interval  S  e  Q ^ 

On  the  other  hand,  the  exact  solution  (if  it  exists)  of  the 
convolution  equation 

=  f  *  G 

for  some  f  and  G  as  above 
which  may  be  alleviated  to  some 
function  transform  ch(g)(S)  = 

the  above  equation  to 

ch($B)  =  ch ( f ) • ch ( G ) 

Also,  solutions  of  these  equations  may  be  related  to  unimodal  infi¬ 
nitely  divisible  distributions,  but  we  leave  this  for  future  woik. 
(See  also,  Lukacs  [161].) 


for  ,  can  entail  difficulties, 

degree  by  use  of  the  characteristic 

Ji  sx 

e  g(x)dx,  etc.,  which  converts 

xe  R 


Randoa  Sets  and  Dispersions 


379 


Lastly,  we  wish  to  indicate  connections  between  random  inter¬ 
vals  and  interval  mathematics  in  general  (see,  e.g.,  Nickel  [194]) 
could  be  explored  further  in  relation  to  dispersions. 


F .  A  connection  between  fuzzy  sets,  randoa  sets  and  random 
variables 


So  far  in  this  paper,  connections  have  been  established  between 
the  membership  functions  of  generalized  sets  and  the  one  point 
coverage  functions  of  random  sets.  Recently  [86],  [89]  it  has  been 
shown  that  random  variable  evaluation  functions  may  also  be  directly 
related  to  membership  functions  and  one  point  coverage  functions. 
This  result  is  restated. 

If  Y  is  any  space  of  elementary  events  and  a  s  »(Y)  is  any 
collection  of  compound  or  elementary  events  for  random  variable  V 
over  Y  (i.e.,  a  S  a  ,  the  o-algebra  over  Y  for  V)  ,  the  func¬ 
tion  gv  :  a  -►  [0,1]  where  for  any  B  «  a  ,  g(B)  =  Pr(V  €  B )  ,  is 

called  the  evaluation  function  for  V  over  event  collection  a  . 
Recall  the  notation  f^  as  the  one  point  coverage  function  for  any 

S  €  a(X)  for  any  space  X  . 


Theorei  8 


1.  Let  X  be  any  space  and  S  e  a(X)  .  Then  X  may  be  identi¬ 
fied  as  a  collection  of  events  a  for  some  random  variable  V  over 
say  Y  such  that 


and  whose  evaluation  function  over  a  is  the  same  as  fs  . 

2.  Let  V  be  any  random  variable  over,  say,  Y  with  a  any  col¬ 

lection  of  events  for  V  .  Then  letting  X  =  a  and  defining  S  by 
eq .  (1),  evaluation  function  gv  for  V  over  a  and  fg  both 

coincide . 

3.  Let  X  be  any  space  and  let  A  €  7{X)  be  arbitrary.  Let 

S  €  a(X)  be  arbitrary  with  S  e  f  ^ (A)  such  as  Sy(A),  T(A)  ,  for 

example.  Then  applying  part  1  above,  X  may  be  identified  as  a 
collection  of  events  a  for  some  random  variables  V  over  say,  Y 
such  that  eq.  (1)  holds  and  ,  f  ,  and  9V  coinc:ide-  That 

is,  any  fuzzy  set  is  one  point  equivalent  to  the  evaluation  function 
of  suitably  chosen  random  variables  over  event  collections. 


Proof  and  constructions  for  Parts  1  and  2: 

1:  First,  suppose  that  S  :  ft  -•  a(X)  is  a  random  set  relative  to 

initial  probability  space  (n,a,Pr)  and  induced  probability  space 

( range ( S ), $ , Pr° S  *)  ,  range(S)  £  ?(X)  . 


Now , 
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<3  ( s )  =  <c{x}(rng(S))  |  x  e  x} 

=  {c {x} (rng{S) )  |  x  e  X  ‘  }  £  B 


where  X'  £  X  is  chosen  so  that  the  mapping  r 
bijective,  where  for  any  x  €  X‘  , 


<3(S] 


is 


y>(x)  =  c  u}  (rng(S)  )  . 


Clearly , 


X(S)  =  { (Bf  |  Be  rng(S) } 

is  a  disjoint  exhaustive  partitioning  of  rng(S)  with  unique  repre¬ 
sentation 


c  . (rng(S))  =  u  (B> 

'  •  B€C(x)(rngS) 

for  each  x  e  X*  .  Thus  {S_1({B})  |  B  e  rng(S)}  is  a  disjoint 

exhaustive  partitioning  of  ft  . 

For  each  B  e  rng(S)  ,  let 

VB  =  S_1(B)  -  YB 

be  any  surjective  random  variable  for  non-empty  arbitrary  measure 
space  IYb.ji.)  such  that  all  Y  '  s  are  disjoint.  Then  define 

D  D  O 

Y  -  0  yb 

B«  rng ( S ) 

and  define  random  variable 

V  :  ft  -•  Y 

where  for  any  u  e  ft  ,  there  is  a  unique  B(w)  e  rng(S)  such  that 
u  e  S_1 (B(w ) ) 

and  hence 


V(w)  =  VB{w)(«) 


is  unambiguously  defined,  and  without  loss  of  generality  V  may  be 
assummed  measurable. 

Define  the  mapping  a  :  X '  *(Y)  by,  for  any  x  e  X‘  , 

x  =  v ( S  1 (c (x) (rng(S) ) ) } 

=  U  Y 


B€C (X>(rng(s) 1 


y  (Y) 
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Then  it  follows  that  a  is  injective  and  for  any  x  <s  X' 
V-1 (x)  =  S_1(C  (rng(S) ) ) 
which  is  equivalent,  for  all  w  e  n  , 

=  C  <V(u  )  )  (X)  ' 

and  which  implies,  for  all  x  e  X*  , 

Pr(V_1(x))  =  Pr(S_1{C(x} (rng(S) ) ) )  , 


g  (x)  =  Pr(V  €  x)  =  Pr(x  «  S)  =  f  (x)  ,-  all  x  €  X1  . 

Hence  (1)  holds  with  S  identified  with  S  and  X  =  »  . 

2:  Let  V  :  n  -»  Y  be  a  random  variable  for  measure  space  (Y,») 

Then  eq.  (1),  which  is  the  same  as 


<V(w)  ) 


all  u  €  a  , 


is  equivalent  to 


V  1  (x)  =  S  1  ( c  { x}  ( rng (  S )  )  ) 


and  hence 

gv(x)  =  fg(x)  ;  all  x  e  X'  . 

One  consequence  of  Theorem  8  is  a  new  interpretation  for  possi- 
bities  or  equivalently,  values  •  * or  x  e  X  ,  where  A  e  ?(X) 

is  given  through  membership  function  <pA  :  X  -•  [0,1]  : 

For  any  x  e  X  , 

Possibility  of  x  e  A  =  Possfx  e  A) 

=  Pr ( x  €  S ( A ) )  =  Pr (V(A)  €  x)  , 

and  since  the  events  x  e  X  can  also  be  considered  compound  or 

elementary  events  for  V(A)  over  Y  which  may  well  be  overlap¬ 

ping  and  perhaps  exhaustive,  possibilities  need  not  sum  to  unity 
when  X  is  discrete.  However,  when  -  and  only  when  -  <fA  is  an 

ordinary  or  deficient  probability  function,  possibilities  will  sum 
to  unity  or  to  less  that  unity,  possibilities  and  probabi 1 i ties 
coincide,  and  the  events  for  V(A)  are  all  necessarily  elementary 

and  disjoint,  with  V(A)  and  S ( A }  also  being  identifiable. 
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5.3  Multiple  point  coverage  problem  f891. 

A.  The  multiple  point  random  set  coverage  problem. 

For  any  integer  m  >  1  ,  define  as  the  collection  of 

all  non-vacuous  subsets  of  X  with  cardinality  <  m  .  Then  define 

‘J(m)(X)  -  <f  I  f  :  -  [0'^  & 

there  is  an  S  e  a(X)  such  that 

f (C)  =  Pr(C  s  S)  ;  all  C  e  *  (X) } 

(m) 

=  {f  I  f  :  »{m)(X)  -  C0.1])  . 

with,  in  general,  strict  subset  inclusion  holding  above.  That  is, 

there  are  functions  f  :  9  -»  [0,1]  which  are  not  the  m  point 

( in ) 

coverage  function  of  some  random  subset  of  X  .  A  simple  example  of 
this  is  generated  for  the  case  m  =  2  by  first  noting  the  basic 

constraints  for  any  random  subset  S  of  X  and  any 

x,y  €  X  : 

max ( Pr ( x  e  S)  +  Pr(y  e  S)  -  1,0)  <  Pr({x,y)  £  S)  (• ) 

S  min(Pr(x  €  S),  Pr(y  €  S ) ) 

and  then  choosing,  e.g.,  any  such  f  with  at  least  some  x,y  e  X 
such  that 

min  ( f ( {x} ) , f ( (y) ) )  <  f({x,y})  . 

This  situation  contrasts  sharply  with  the  case  m  =  l  where  indeed 
»(X)  =  »(1)(X)  =  <f  I  t  :  »(1)(X)  -  [0,1]}  , 

abusing  notation  somewhat  in  identifying  A  with  c|»  ,  for  any 

A  €  7 (X)  . 

In  the  following  discussion,  X  need  not  be  finite  nor  even 
discrete.  Clearly, 


*(1)<X>  2  ?(2)(X)  2  *(3)(X)  2  •••  2  *(~)(X)  =  ^  7(j)(X) 

and  if  for  any  f  e  ,(X)  , 

( m ) 

r  (f)  =  (s  |  S  €  a ( X)  &  f(C)  =  Pr ( C  £  S)  ;  all  C  e  a ,  ,(X)}  , 

lit)  ( m ) 


Vf)  =  y(l)<f>  2  '(2)(f>  2  •••  2  '<m)(f)  • 
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Letting  J(X)  be  the  class  of  all  finite  subsets  of  X  , 

5  (X)  =  (f  |  f  :  J  (X)  ->  [0,1]  & 

there  is  an  S  e  *(X)  such  that 
f(C)  =  Pr ( C  £  S)  ;  all  C  e  j(X)}  . 

Define  also  for  any  f  e  fF  j  ( X )  , 

(f)  =  (S  I  S  e  ot(X)  &  f(C)  =  Pr  ( C  £  S)  ;  all  C  €  }(X)}  . 

Then  gathering  all  of  these  definitions  together  yields  the  follow¬ 
ing  theorem: 


Theorem  8 . 

For  any  space  X  : 

1.  5(„)(X)  is  the  class  of  all  doubt  measures  over  5(X)  . 

2.  For  any  integer  m  >  1  ,  *(X)  is  disjointly  partitioned  as 


*{X)  =  U 


f€3F(m)(X) 


<y(m)(f)) 


3.  For  f  e  ,  there  is  a  unique  Sf  e  'ji(X)  such  that 

V)(f)  =  <Sf)  , 

with  the  converse,  for  each  S  e  »(X)  determining  the  coverage 
function  fg  by 


fs(C)  =  Pr ( C  £  S)  ;  all  C  e  }{X)  . 


4.  <  a(x)  <  and  r(X)  ,  the  class  of  all  zero-one  stoc¬ 
hastic  processes  over  X  ,  are  all  in  a  bijective  relationship  as  in 
the  diagram: 


*(X) 
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where  for  any  S  <=  tt(X)  ,  i(|>(S)  =  ($e(x))  =v  and  p  is  determined 
by  the  Kolmogorov  Extension  Theorem  applied  to  the  relations  in  5. 

5.  For  any  V  =  (V(x))  e  r(X)  and  any  C,D  «  J (X)  disjoint, 

A 

Pr((  &  ( V ( x )  =  1 ) )  &  (  fit  (V(x)  =  0)))  =  Z  ( (~i)card(K). f  (CUK) ) 

xeC  xeD  KSD 

where  f^  e  is  given  by,  for  any  B  e  J  (X)  as 

fv(B)  =  Pr {  fit  ( V( x )  =  1) )  . 

X€B 

Thus  in  terms  of  degree  of  coverages,  5(X)  and  »(X)  repre¬ 
sent  opposite  extremes. 

Returning  to  the  case  for  X  finite,  let  m  >  1  be  arbitrary 
fixed  with  card(X)  =  n  i  3  .  Define  for  any  S  e  »(X),  recalling 
the  total  order  defined  over  »(X)  , 

■  c”c^(X)  <V.  U  • 

card (CJim+l 


n ,  m 


n 

Z 

k=m+l 


ra>  > 


CBj(X)  =  {C  I  B  S  C  €  »(X)  &  card ( C )  =  j}  ; 


all  C€^(X);0SjSn. 


Also,  define  for  any  n  £  m  >  r,s  £  0  and  all  f  e  ?  .(X)  and 


C  e  t  (X)  ,  card ( C )  =  r  , 


(m) 


wr,0(f'C)  =  f(C)  :  for  s  =  0 


urg(f,C)  =  f(C) 


+  2 


(  (-D 


t=l 


Bee 


(f (B) ) )  ;  s  >  1  ; 


C,  t  +  r 


fi]  f  i.  iff  j  is  even 

l<Jj  l  S  iff  j  is  odd  . 


Theorei  9 


For  any  finite  space  X  ,  with  notation  as  above,  for  any 


f  €  »  ,  m  .  (  X  !  : 

( m } 


1  . 


*(m)(f)  =  {S  |  S  6  *(X)  &  ’r(m+1)(S)  €  *(m)(f)}  . 
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where 


*(m) ( f )  =  {W  |  W  «  R 


n,m 


is  arbitrary  satisfying  eqs .  (*), 


( *  * )  below} 

(- 


l)j-V  1  -  (  f  #  C )  [*].  z  f  [  t-m+J-1]  .  z  ( W  )]  (*); 

m-j,j  1<J3  1  -1  Bec  ( X )  B  J 

U  /  L 

all  C  e  ?{X)  with  card(C)  =  m  -  j,  for  j  =  0,1,2,. ...m  ; 

W  >  0  ;  all  B  e  ?(X)  with  card(B)  2:  r.  +  1  ;  (**) 

O 

all  f  e  *{m)(X)  , 


W  (WB}B€»(X) 


e  R 


n,m 


card(B)£m+l 


2.  * ,  .(f)  is  a  closed  convex  region  in  R  n,m  having,  in  gener- 

( m ) 

in  f  i  ^ 

al,  a  +  X  (  )  =  2  hyperplane  bounds,  and  thus,  in  general, 

n , m  L  t  J 

there  is  an  uncountable  infinity  of  possible  7^m+1*(S)  in  it  or 
equivalently  S  e  y  ^  (  f  )  • 

3.  There  is  a  bijective  correspondence  between  and 

»(m)(')  -  where  for  any  given  W  €  *^mj(f)  ,  S  e  y^jff)  is  uni~ 

quely  determined  by 


7  (m+1 ) 


(S) 


Pr ( S  =  C) 


W  , 

n 


u  (f,C)+(-l) 

®- J  <  J 


j  +  1 


2  f  ft-m+j-l 
t=m+l  j 


C,t 


for  all  C  «  ?(X)  with  card(C)  =  m  -  j  ,  j  =  0,1, ...,m  . 


Outline  of  proofs  for  Theoreas  4  and  9. 

First  obtain  Theorem  4  by  simply  extending  the  basic  identity 
<?>A(x)  =  Pr(x  e  S ( A)  )  in  terms  of  Pr(S  =  C)'s  together  with  the 

standard  probability  constraints  and  solve  in  terms  of  t '  ' (S) 

Theorem  9  is  obtained  by  tedious  induction.  The  key  computational 
identity  useful  in  the  proofs  is 
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2  (  1 

B«Cck(X)  D«Cb 

for  any  f  :  ?(X)  -* 
card(C)  =  i  . 


ir  ;  j  £  k  2:  i  ,  and 


•  2  (t (U)\  , 

D€CC.j{X) 

any  C  e  (X)  with 


Remark. 

In  Chapters  3  and  4,  explicit  relations  were  developed  between 
random  sets  and  uncertainty  measures  in  terms  of  subset  coverage 
functions  for  doubt  measures,  superset  coverage  functions  for  belief 
measures,  and  incidence  functions  for  plausibility  measures.  Thus, 
a  restricted  form  of  the  multiple  point  coverage  problem  (obviously 
including  the  one  point  coverage  problem)  where  multiple  point  cov¬ 
erages  are  given  in  terms  of  a  doubt  measure  of  sets  of  multiple 
points  is  solved  by  construction  of  a  random  set  as  given  in  Chap¬ 
ters  3  and  4.  In  the  multiple  point  coverage  problem)  discussed 
here,  no  restriction  in  general  (except  to  allow  some  random  set 
representation)  is  required  on  the  coverage  function. 

Recall  (see  Chapter  3.3)  that  Zadeh’s  possibility  measure  is  a 
special  case  of  a  semi-distributive  t-norm  possibility  measure, 
where  the  t-norm  is  min.  (See  Theorem  4,  Chapter  3,  where  these 
possibility  measures  are  related  to  random  set  coverage  and  inci¬ 
dence  functions.)  In  the  same  spirit  of  Theorems  2  and  3  (see  also 
the  following  remark)  of  Chapter  3,  a  relation  may  be  established 
between  Zadeh ' s  possibility  measure  and  random  sets,  by  direct  use 
of  random  set  theory  (a  la  Matheron) : 


Proposition  1 

Let  it  be  a  possibility  measure  on  X  ,  with  possibility  dis¬ 
tribution  function  f  .  Then  there  exists  a  random  set  S  on  X 
such  that: 

V  A  S  X  ,  71  (A)  =  sup  (71  (X)  |  I  S  A,  I  e  J(X)> 

with  ff(I)  =  Pr ( S  n  I  *  0)  . 


Proof : 

It  is  clear  that  the  restriction  of  J!  to  3  (X)  is  a  space 
law,  and  hence  by  Matheron' s  theorem  [171],  there  exists  a  unique 
probability  measure  Q  on  (?(X),M)  such  that: 

v  I  €  3  ( X )  ,  7i(l)  =  Q(A  |  An  1*0) 

Let  S  be  a  random  set  with  Q  as  probability  law,  i.e., 

77  ( I)  =  Pr(S  n  i  *  0)  ,  v  i  e  3  ( X )  . 


The  result  follows  from  the  fact  that 


Random  Sets  and  Dispersions 


387 


VA  S  X  ,  77(A)  =  sup  {77(1),  ISA,  I  e  J(X)}  . 

A  similar  result  for  the  topological  setting  is: 


Proposition  2 

Let  X  be  a  locally  compact  and  separable  space,  and 
f  :  X  -»  (0,1]  upper-semicontinuous  (u.s.c.).  Then  there  exists  a 
random  set  S  on  (3,o(3))  such  that: 

V  A  S  X  ,  77(A)  =  inf  (77(G)  |  A  £  G ,  G  e  <j  } 

where  77(A)  =  sup  (f(x)  |  x  e  A  }  ,  and 

77(G)  =  sup  { Pr  ( S  n  K  *  0)  |  K  e  X,  K  S  G)  . 


Proof : 

First,  let  us  show  that  there  exists  a  unique  probability  mea¬ 
sure  Q  an  (3,o(3))  such  that 

V  K  €  Jt,  77  { K )  =  Q(F  I  F  n  K  *  0}  . 

By  Choquet's  theorem,  it  suffices  to  verify  that  77  is  a  Choquet 
capacity,  alternating  of  infinite  order.  The  fact  that  77  is  al¬ 
ternating  of  infinite  order  is  obvious.  Now  let  Kn  €  *  «  n  -  1  > 

and  K  e  x  such  that  Kr  A  K  .  Let  a  =  inf  77  ( Kn )  and  p  =  77  ( K )  . 

It  is  obvious  that  p  z  a  .  Now  let  e  >  0  ,  and  set 
8  =  a  -  £  <  a  .  It  follows  that  Vn  £  1  ,  77{Kn)  >  0  -  i.e., 

sup  (f(x)  |  x  e  Kn)  >  a  ,  vn  2  1  . 

j 

Hence:  A  =  (x  e  x  I  f(x)  2  0)  n  K  *  0  and  since  by  construction, 

n  n  1 

v  21, A  SK  ex  (compact),  and  the  A  's  are  closed  (since  f 
n  n  n  l  n 

is  u.s.c.  by  hypothesis),  we  have 


d 

a  s  n  a  * 

,  n 
n=l 

0  . 

Now 

A  S  K  ;  we  have 

77(A)  S  77  (  K ) 

=  P  • 

But , 

by  construction,  of 

the 

An's  ,  77(A)  2  a  ; 

thus ,  a  s  p 

,  therefore 

a  =  p  . 

Finally,  since  X 

is  locally 

compact 

and 

separable,  any  Borel 

set 

A  is  capacitable, 

.  i.e.. 

77(A)  =  sup  { 77  ( K )  I  K€  X,  KS  A)  , 

in  particular,  for  A  e  G  .  Let  S  be  a  random  set  on  (3, 0(3)) 
having  Q  as  probability  law. 
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rr  ( K)  =  Pr  ( S  n  K  *  0)  ,  K  e  x  . 

The  approximation  of  ff(A)  ,  for  ASX,  from  n ( G ) 
G  2  A  ,  is  the  usual  extension  of  capacity. 


G  €  <3 


and 


B .  Multiple  point  coverage  functions  and  Bigg's  topos . 
Lemma 


Let  X  =  {0,1}  x  {0,1}  and  P  be  a  given  bivariate  proba¬ 
bility  function  over  X  .  Denote  the  x  -  and  y  -  marginal 
probabilities  of  P  ,  respectively,  by 

?1  =  P(l.l)  +  P(1.0) 

P2  =  P(l,l)  +  P(0,1)  . 

Then 

{q  |  q  is  a  probability  function  over  X  such  that 
the  x-  and  y-  marginal  probabilities  of  q 
are  the  same  as  that  for  P} 

=  {q€  |  -  min(P( 1,1),  P(0.0))  <  e  £  min{P(l,0),  P(0,l),  P ( 1 , 1 ) } 

where  for  any  such  «  , 

q€(i,j)  ^  P(i,j)  +  ( -1 ) J • « 

for  any  (  i  ,  j  )  e  X  . 

For  any  such  q€  ,  in  particular, 

q€(l,l)  =  P ( 1 , 1 )  +  €  • 


Application: 

Consider  a  random  subset  S  of  a  space  Y  and  its 
havior  at  two  fixed  subsets  of  Y  :  (x,y)  and  {x',y') 


local  be- 
Now  the 


local  behavior  of  S  at  any  point 
marginal  zero-one  random  variable 

random  characteristic  function  induced  by 


z  e  Y  may  be  identified  with  a 


v2  =  *s<z) 


,  where 
Thus , 


is  the 


VZ  =  1 


Hence,  we  may  define 


iff 

iff 


Z  e  S 
Z  «  S 
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Pr ( x , y  e  S)  =  Pr(Vx  =  1 ,  Vy  =  1) 
Pr ( x  e  S)  =  Pr(Vx  =  1) 

Pr(y  <  S)  =  Pr(Vy  =  0)  ,  etc. 

Thus  by  choosing  S  such  that 

Pr(Vx,  =  i.  Vy,  =  j)  =  q€(i,j)  , 

for  any  fixed  e  as  above,  in  general, 

Pr ( x  €  S)  =  Pr ( x '  e  s) 

Pr ( y  €  s)  =  Pr ( y 1  €  S)  , 

but 


Pr(x  e  S,  y  €  S)  =  Pr(x'  e  S,  y'  e  S)  +  € 
*  Pr(x‘  e  s,  y'  e  S)  , 


in  general . 

Hence,  in  general,  it  is  not  true 
3  f  :  [0,1]  x  [0,1]  -•  [0,1]  such  that  for  all  x,y  e  y  , 

(I)  Pr({x,y)  £  S)  =  f ( Pr ( x  e  S)  ,  Pr(y  e  S))  . 

On  the  other  hand,  for  certain  random  sets  this  is  not  only 
true  but  f  is  also  a  t-norm: 

Let  :  [0,1]  x  [0,1]  -*  [0,1]  be  any  semi-distributive 

t-norm.  Then  for  any  Y  (see  Goodman  [91]  or  Hohle  [116]  for 
further  details)  and  any  given  mapping  4>A  :  Y  -*  [0,1]  ,  there 

exists  a  random  subset  S.  r  of  Y  such  that  for  all 

Yl . ym  €  Y  ,  m  i  1  , 

(i)  Pr'(*l . V  S  SA;&> 

=  <t>£ ( Pr ( y i  ^  sA;&) .  Pr(y  *  S^)), 

(ii)  Pr(Yj  €  SA;a)  =  ^A(Yj)  >  J  =  1 . m  • 

In  particular; 

(  i  ii  )  For  =  min  , 

SA.S  '  VA>  3  ■ 


where  U  is  a  r.v.  uniformly  distributed  over  [0,1]  . 
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(iv)  For  $£  =  prod  , 

S.  f  =  T  , 

where  T  is  such  that  the  corresponding  random  membership  function 
<$>  is  equivalent  to  a  statistically  independent  0-1  process, 

i  .  e  .  , 


Pr{{y1 . ym}  £  T)  s  Pr(Yl  e  T)  -Pr(ym  e  T) 

=  VV'- W  ' 

Note  also,  for  any  random  subset  S  of  Y  ,  the  inequalities 
in  eq .  (  )  ,  section  5.3  (A),  are 

(*)  max ( 0 , Pr ( xe S )  5  Pr({x,y}£  S)  <  min(Pr(x  e  S),  Pr(yeS)) 

+(Pr(y«S)-l) 

the  same  bounds  formally  for  any  continuous  t-norm  <p  as  a  func¬ 
tion  of  Pr(x  e  S)  ,  Pr(y  €  S)  . 

Note  some  formal  similarities  in  eq.  (I)  with  Sklar'c 
Theorem . 

Now  consider  the  converse  problem: 

Let  Y  be  an  arbitrary  space  and  let  J'  'Y)  denote 
<  C  I  C  €  9 (Y)  &  card  C  S  2)‘  . 

(  2  ) 

Let  g  :  J  (Y)  -•  [0,1]  be  a  given  function.  Then  we  wish  to 
find  all  random  subsets  S  ,  if  any,  such  that  for  all 

C  «  J( 2 1 ( Y)  , 

g ( C )  =  Pr ( C  2  S)  . 

Equivalently,  let  g  :  Y  x  Y  [0,1]  be  symmetric  and  S  a 
random  subset  of  Y  such  that  for  all 
x ,  y  €  y  , 

(II)  g ( x, y )  =  Pr ( (x, y)  S  S)  . 

Letting  x  -  y  ,  this  includes  the  condition 

(III)  g(x,x)=Pr(x€S)  ;all  xeY. 

But  because  of  the  negation  of  relation  (I),  we  know  (II)  (and 
III)  ir.  general,  may  have  no  solution  for  an  S  given  a  g  : 

In  particular,  pick  any  g  :  Y  x  Y  -»  [0,1]  symmetric  such  that 
g  is  arbitrary  over  diag  ( 7  x  Y)  ,  and  such  that  there  is  at  least 
two  distinct  points  X0'Y0  **  ^  such  that 

I  *  9(Y0-Y0)  >  1  -  g(xo,XQ)  2  0  . 
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Then  define  g  otherwise  off  the  main  diagonal  to  be  arbitr¬ 
ary  symmetric  such  that  at  (x  ,yQ) 

0  ^  g(xQ,yo)  =  g(yQ,xo)  <  g(xo,xQ)  +  g(yo,yQ)  -  1  . 

Then  equation  (• )  will  be  violated,  no  matter  what  choice  of 

S  . 

Recall  that  any  random  variable  V  over  Y  may  be  considered 
a  random  set  S  =  {V}  .  Another  example  of  the  impossibility  in 

general,  of  II,  III  admitting  a  solution  S  for  a  given  g  is  ob¬ 
tained  by  choosing,  for  example,  Y  to  be  discrete 

gQ(x,y)  =  gQ(x,x) -5Y(0) (x,y)  ;  all  x.y  e  Y  , 

Gy*'0''  the  ciassical  Kronecker  delta  function^  and  such  that 

2  g(x,x)  >  1  :  0  <  gQ(x,x)  <  1  ,  all  x  €  Y  . 
xeY 

It  then  follows  that  if  3  S  satisfying  (II, III),  then  S  =  (V) 
for  some  r.v.  V  over  Y  .  But 

1  =  Pr ( V  e  Y)  =  X  Pr ( V  =  x)  =  X  g(x,x)  >  1 
xe  Y  xe  Y 

a  contradiction1 

Let  H  =  [0,1]  .  Let  X  «  SET  .  Recall  (see  section  2.4.2 

(E) )  g  €  0bx(Higg(H) )  iff 

(1)  g  :  X  x  X  -*  H  is  symmetric 

(  2  )  g  (  x ,  y )  a  g  ( y ,  z  )  s  g  ( x ,  z  ) 
for  all  x , y , z  e  X  . 

In  general,  given  any  g  satisfying  (••  )  ,  no  random  subset 

S  of  X  exists  such  that  g  and  S  satisfy  (II,  III):  In  parti¬ 
cular,  choose  g  =  gQ  .  It  is  clear  that  gQ  satisfies  (• ■ )  ,  but 

as  the  previous  example  has  shown,  no  S  satisfies  (II, III)  for 
given  gQ  .  On  the  other  hand,  there  are  g's  satisfying  (  •  •  ) 

which  admit  random  subsets  S  of  X  such  that  (II, III)  are  satis¬ 
fied.  For  example,  let  (X,y)  e  0bx(Fuz(H))  be  arbitrary.  Then 

le*  ,  for  all  x , y  €  X  , 

g  ( x ,  y )  =  y  ( x )  a  y  ( y )  . 

Then  the  previous  example  shows  S^(y)  =  y  1  [U,l]  ,  U  uniform  on 

[0,1]  will  satisfy  (II, III)  for  g  here. 

More  generally,  we  have  the  following  positive  result 
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Lemma 

Given  any  space  X  and  any  random  subset  S  of  X  ,  3  a 

t-norm  (continuous)  <p&  such  that 

<|>&(g(x,y).  g  ( y ,  z )  )  <  g(x,z)  ,  (•••) 

where 

g(x,y)  =  Pr ( ( x . y }  £  S)  , 
for  all  x,y,z  e  X  . 


Proof : 

Since  the  minimal  (continuous)  t-norm  is  minbndsum,  ^ 0 ^ 

where 

<Ji&^0)(u,v)  -  max(u  +  v  -  1,0)  , 


for  all  u,v  e  [0,1]  ,  all  we  have  to  do  is  verify  (•••) 

9  =  *  <°>  : 

Pick  any  three  points  x,y,z  e  X  . 

Consider  g(x.y)  +  g(y,z) 

=  Pr ( x  €  S.  y  e  S)  +  Pr(y  e  S,  z  e  S) 

=  ( Pr ( x  e  S  |  y  €  s)  +  Pr ( z  €  s  |  y  e  S))-Pr(y  e  s) 

5  '  aor^io  ]<an  +  aio  +  an  +  aoi,-pr(Y  €  S)  ' 

such  that  a  is 
fixed,  with 

an+aio+aoi+aoo=1. 


where 


a  =  Pr(x  e  S,  z  e  S|y  e  S) 

a1Q  ^  Pr ( x  «  S,  z  <  S  |  y  e  S) 

aQ1  3  lr(x  <  S,  z  e  S  i  y  €  S) 

aQ0  ^  Pr ( x  «  S,  z«S|yeS) 

=  ( 2a: :  +  1  -  au)'Pr(y  «  S ) 
=  (au  +  1 )  •  Pr ( y  €  s)  . 


Hence 
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g ( x , y )  +  g ( y , z )  Z  (Pr(x  e  S,  z  e  S  |  y  e  S)  +  1)-Pr(y  e  S) 

=  Pr(x  €  s,  z  e  s,  y  e  s)  +  Pr(y  e  S) 

<.  Pr(x  e  S,  z  €  S)  +  1  . 

Thus , 

$^.0)  (g(x,y)  ,  g ( y ,  z )  )  s  Pr(x  €  s,  z  e  S)  =  g(x,z)  . 

If  <|>&  =  a  (i.e.,  min)  ,  a  solution  set  of  g's  satisfying 
( • • • )  is  given  by 

g  ( x ,  y )  =  F(x)  a  ¥-{y)  , 

for  any  (X,y)  €  Ob^fFuzfH))  . 

If  4>&  is  any  Archimedean  t-norm,  then  since  3  unique 

h  :  [0,1]  -♦  [0,-k»]  monotone  decreasing  with  h ( 1 )  =  0  , 
with 

<?&(u,v)  =  h  1  (min(h(u)  +  h(v),h(0))) 
for  all,  u,v  e  [0,1]  (see  section  2.3.6),  whence  (•••)  becomes 
h-1(min(h°g(x,y)  +  h°g(y,z),  h(0)))  <  g(x,z) 

and  hence 


m i n ( h° g ( x , y )  +  hog(y,z),  h ( 0 ) )  <  hog(x,z)  . 

In  particular,  If  <f>&  is  strict  Archimedean,  h(0)  =  +«> 

Then  the  above  relation  is  equivalent  to  h°g  being  a  pseudometric 
d  over  X  ;  conversely,  given  any  pseudometric  d  over  X  , 

g  -  h  1od  will  satisfy  (  •  •  •  )  . 

Finally,  note  that  it  easily  follows  from  the  property  of  t- 
norms  that  if  g,g‘  are  any  solutions  for  (•••)  ,  then  so  is 

$&ig .g ' )  • 

Thus  again,  Higg(H)  -  or  more  generally  -  the  extension 
Higg(H;  ,  <f>Qr)  defined  through  (•••)  plays  a  natural  role  in 

extending  fuzzy  sets  to  a  joint  membership  definition  (v,  "or",  is 

similarly  replaced  by  $Qr  )  . 

The  natural  questions  we  thus  pose  are: 

(1)  What  are  the  properties  of  Fuz  ( H ;  <$)& ,  <Por’>  when  a  =  min 

is  everywhere  replaced  in  Fuz(H)  by  ,  an  arbitrary  but 

fixed  t-norm? 

(2)  What  are  the  properties  of  Higg(H,  •I'qj.)  ? 
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5 . 4  Entropy  of  Random  Set  Systems  and  Coverages. 


Another  basic  problem  associated  with  generalized  sets  or 
possibility  theory,  or  more  generally,  multiple-point  coverage 
functions  and  random  sets  is  the  determination  of  meaningful  cri¬ 
teria  for  ordering,  in  some  sense,  random  sets  in  terms  of  "best" 
representing  coverage  functions.  Among  natural  criteria,  should  be 
mentioned:  expected  volume  (or  expected  cardinality  in  the  finite 
case)  and  entropy. 

Denote  by  fg  the  coverage  function  of  a  random  set  S  on  an 
arbitrary  space  X  .  Recall  Robbins'  formula  (section  4.4): 

(i)  If  X  s  Rn  ,  then  under  suitable  measurability  conditions, 
we  have,  for  any  mil. 


JR 

.  .  ,  X  )  ' 

nr  . 

l 

(ii) 

If  X  is  a  finite  space, 

E (  (  card (  S  )  )m)  =  • 

(Xjf • . 

.  ^ 

.  ,X  €  X ) 

m 

Thus , 

for  any  f  €  * (m) (X) 

(S)k) 

=  constant,  k  =  1,2, 

.  .  .  ,m  , 

m 


f  (x  , . . . ,x  )  . 
SI  m 


from  y^mj(X)  .  Thus,  another  criterion  must  be  sought 

sensitive  to  variable  S  e  y  .(X)  . 

(m) 


S  chosen 
which  is 


Remark . 

Robbins'  formula  was  later  independently  rediscovered  by  Pratt 
[205],  and  in  turn  used  by  Hooper  [119]  in  demonstrating  that  most 
common  figures  -  of  merit  -  for  randomized  test  procedures  and  ran¬ 
domized  confidence  sets  involv-  the  random  sets  determining  these 
procedures  only  through  their  one  point  coverage  functions.  (See 
section  5  5 . ) 

We  proceed  now  to  discuss  maximal  and  minimal  entropy  problems 
for  discrete  spaces. 


A.  Fundamental  maximal  entropy  theorem  for  discrete  spaces. 

(See,  e.g.,  Jaynes  [123],). 

Let  X  be  any  fixed  discrete  space.  For  any  probability  func¬ 
tion  P  e  ?r(X)  ,  the  class  of  all  probability  functions  over  X  , 

define  the  entropy  of  P  by 

Ent(P)  =  Z  -log  P(x)-P(x)  . 

X€X 


o 


Let  J 


be  any  finite  index  set. 
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g  =  (g  . ) l€ ,  (column  vector  of  functions)  f 
3  3  Jo 


U.  /  \ 


[  column  vector ) , 


A  s  (A  .)  .  (column  vector)  , 

3  Jej0 


where  for  each  j  e  Jq  ,  :  X  -»  R  is  an  arbitrary  fixed  function, 

i j  €  r  arbitrary  fixed  constant. 

Next,  (assuming  convergence)  define 

9  (X)  =  (P  |  P  €  3>r  ( X)  &  z  p(X)-g  (x)  =  7.-  ,  j  €  J  )  . 

(g,,)  xeX  J  J  ° 


sup  Ent ( P )  =  loy  (G{ A ) )  -a-7  , 

Pe"(g,.)(x) 

occurring  for  P  =  P.  .  ,  where  P.  .  is  determined  by: 

(g.y)  (g.y  )  1 

T 

P(g  A  j (x)  =  ( 1/G{ A ) ) - eA  'g(x) 

G  (  a  )  =  z  eA  '  g(x)  ,K«X, 

X€X 

and  a  is  a  constant  vector  determined  by 

Z  g(x)-eAT-g(x)  =  ,  . 

X€X 

or  equivalently, 

i  ■  log  G(a)/<9Aj  =  i  ,  all  j  e  .  (i 

Let  X  S  Rn  ,  X  discrete. 

In  particular,  let  m  be  such  that  m  >  1  fixed  and  choose 


J  =  U  J,,  ,  . 
°  Jc-1  (k) 


where 


J(k>  =  (1'2' 


■n>  =  <  (  . 3k)  I  . ik  €  {1 . n)  }  , 


k=  1,2 . m  . 


For  each  j  =  (^ . Jk) 


k  =  1,2, . . . , m  ,  define 


g  j  by. 
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g  .  (x)  =  x  •  x  , 
J  Jk 


for  any  x  = 


x. 


€  X  ,  X.  €  IR  . 


This  ’  r  the  first  m-mo,u«n'.t  constrain!:  problem  with  solution 
for  maximal  entropy  being 


P(g.7 ) (x)  = 


constfA )-e 


[5  a  .  x .  +•  •  +  3 


lSj. - ,j  in 


A  .  .  X  .  X  . 

VV  V  V 


m 


with  a  determined  from  eq.  (1). 

In  particular,  let  X  =  {a  +  j-4  I  j  =  0 , l , . . . ,n-l }n  ,  a  lat¬ 
tice  structure. 

For  m  =  1  ,  we  obtain 


J  q  —  “  {!,•••» n}  , 


G( a )  =  n 
k=l 


,  A .  A  •  r 
aA  k  , 

k  e  -  1 


Ak^  -  1 


and  hence  eq.  (1)  reduces  to 

a  j  =  (1/4)  log  (r^)  ,  j  =  1 , . . . , n  , 

where  each  r  is  determined  from 


A-  rr  .  A-  t  . 


3  - 


=  i  .  -  a  ;  j  =  1 ,  .  .  .  ,  n  . 


r  .  - 1  r  .-1 

3  3 


Specializing  further,  suppose  a  =  0  ,  A  =  l  ,  and  r  =  2  , 

i.e.,  X  =  { 0 , 1 } n  ,  m  =  1  and  n  2  1  as  before.  Then  here 


G(A)=  n  (e  ~  +  1 )  , 

k=l 

rj  =  7  j /  ( 1  ”  7  j  *  ’  ^  =  1 . n  ' 


implying 
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n 

G  ( A  )  =1/77  (1  -  i  .,) 

3-1  3 


and  finally 


P,  ,(x) 

ig.i) 


I  I  K  3-  d 

3=1 L  J 


i  . 
1 


all  x  «  X  . 


(For  a  *  0  ,  A  =  1  ,  r  =  2  ,  but  m  k  2  ,  the  solution  form 
remains  as  in  eq.  (1)  with  a  no  longer  in  simple  form  for  the  a's 
as  the  case  m  =  1  ,  because  of  the  relatively  complicated  form  of 
G(A)  .) 

The  above  special  case  thus  yields  the  same  result  S  =  T  for 
the  problem 


sup  ( Ent ( S ) ) 

S€  *  ( 1 )  < 9 ) 

where  g  =  <}>A  and  P^  ^  j  is  the  probability  function  of  random 
variable  . 

Other  maximal  entropy  problems  involving  m  point  coverages 
may  be  similarly  approached. 


B.  Case  for  finite  base  space.  (See  also  [89].) 

Consider  now  the  case  where  the  base  space  X  is  finite. 
Recall  that  the  entropy  of  a  random  set  S  on  X  is. 

Ent(S)  =  -  ^  Pr ( S  =  C)  log  Pr(S  =  C)  , 

Ce>  ( X ) 


Theorem  10 

For  any  A  e  7 (X)  , 

sup  ( Ent  ( S )  )  =  J  (-<*>  (x)log  <f)  (x)  -  (1  -  $  (x))log(l  -  <f>.(x))) 
S€»  (A)  X€X  A  A  A  A 

occurring  uniquely  for  S  =  T(A)  . 


Proof : 

Either  use  the  fundamental  information  theory  inequality  ap¬ 
plied  to  $s  -  which  is  equivalent  to  a  specialization  of  the  well- 

known  result  that  given  any  marginal  probability  distributions,  the 
joint  probability  distribution  function  maximizing  the  entropy 
corresponds  to  the  marginal  random  variables  being  statistically 
independent,  or  specialize  the  exponential  family  characterization 
of  maximal  entropy  (see  part  A)  to  this  case. 
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Remark . 

The  above  Theorem  10  is  also  well-known  in  the  theory  of  sample 
surveys;  see  Hajek  [103].  Specifically,  in  sampling  from  a  finite 
population  X  ,  a  sampling  design  is  a  probability  distribution  Q 

on  the  collection  of  all  2n  subsets  of  X  (assuming  card(X)  =  n)  . 
In  other  words,  Q  is  the  induced  probability  measure  of  a  random 
set  S  on  X  .  The  inclusion  probabilities  (first  and  second- 
order)  are  precisely  the  values  of  the  coverage  functions  of  S  , 
i  .  e  .  , 


Pr ( x  e  s) 


I  Q(A) 
fx  e  A/ 

La  e  »(x) 


] 


Pr ( <x,y)  £  S)  =  ^  Q(A> 

[t  2  (x,y)l 

La  e  »<x)  J 

The  entropy  of  S  is  a  measure  of  spread-out  for  sampling  probabil¬ 
ities.  For  the  choice  of  a  random  set  representation  (a  sampling 
design)  of  a  given  one-point  coverage  function  (first-order  in¬ 
clusion  probabilities),  one  is  in  favor  of  the  maximum  entropy 
principle  as  an  appropriate  criterion.  More  precisely,  given 
Pr(x  e  S)  ,  for  all  x  e  x  ,  the  random  set  S  which  has  maximum 
entropy  is  the  one  with  probability  measure  Q  on  9>(X)  given  by: 


A  e  g>  (X )  ,  Q(A)  =  x  Pr(x  e  S)  x  (l  -  Pr(y  e  S)) 
xeA  y«A 

(Poisson  sampling  design). 

Now,  motivated  by  Robbins'  formula,  consider  the  weaker 
constraint 


E(card(S))  =  ^Pr(x€  S)  =  i  , 
xeX 

(1  <  i  <  n  =  card(S))  ,  v  given. 

USjng  *-he  Lagrange  multiplier  technique  as  in  the  case  of 
random  variables,  we  obtain: 


Theorem  11 


The  solution  of  the  optimization  problem 


maximize 


subject  to 


-  5  Pr  (  S 
Ae? (X) 

^  Pr  ( x  e 
xeX 


=  A)  log  Pr ( S  =  A) 


S)  =  i 


is  given  by: 
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Pr(S  =  A)  =  — e  /5'Card(A)  ,  A  €  o-  (X)  , 
where  p  is  the  unique  solution  of  the  equation 

g*  (/>)  +  ‘>g(P)  =  o 

.  ..  v  -/J- card  (A) 

with  g(P)  =  )  e  r  . 

Ae?(X) 

C.  Minimal  entropy  problem. 

On  the  other  hand,  the  minimal  entropy  problem  here  poses  more 
difficulties.  It  can  be  shown  that  [89]: 

Theorem  1 2 

*  (X)  , 

( Ent ( S ) )  . 

) 


Let  card(X)  =  n  .  Then  for  any  A  e 

min 


inf  ( Ent ( S ) )  = 
S€3'1(A) 


(j  (2)  (SJer^A) 


Proof : 


By  Theorem  4(3),  the  minimization  problem  reduces  to  the  rou¬ 
tine  minimizing  of  the  sum  of  a  strictly  convex  function  (-x-log  x) 

I  2 ) 

of  linear  combinations  of  components  of  t  (S)  over  the  region 
^(A)  . 

Even  though  (see  remarks  (a)  and  (c)  following  Theorem  4  , 

2 '  ( S  rl(A))  e  r  (A)  ,  and  if  Z  (<f>  (x))  S  1  ,  S' (A)  is  well-  defined 

U  1  xeX  A 

with  7  (2)  (S'  (A)  )  e  r^(A)  ,  it  is  not  always  true  that  the  global 

minimal  entropy  occurs  for  either  one.  A  simple  example  illustrates 

this.  Let  n  =  2  with  X  =  (x^.Xj)  ,  A  e  ?(X)  with 

4>a(x1)  s  <|>A(x2)  '  Let:  h(x)  =  -X*  log  x  ,  all  x  ,  and  define 

yl  =  <^A(X1)  and  y2  =  ^A(X2)  ‘  Finally ,  let 

G ( A )  =  inf  ( Ent ( S ) )  .  Then: 


(i)  If  r( A) 


0  and  y^  >  1/2  ,  then 


G(A)  =  h( 


Yl> 


h(y2) 


+  h(  1 


-  y2> 


occurring  uniquely  for  S  =  S' (A)  . 

(ii)  If  either  r( A)  =  0  and  y2  <  1/2  or  r( A)  >  0  and 
k  1/2  ,  then 


) 
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G ( A )  =  h(y1)  +  h(y2  -  y:)  +  h(l  -  y2 )  , 

occurring  uniquely  for  S  =  Sy(A)  . 

(iii)  If  r( A)  >  0  and  y  <  1/2  ,  then 

G(A)  =  h( 1  -  yx )  +  h{ 1  -  y2)  +  h(yx  +  y2  -  1 )  , 

occurring  uniquely  for  S  =  S"(A)  ,  where  S"(A)  is  determined  by 

Pr(S" (A)  =  {Xj})  =  1  -  y2  ,  Pr(S" (A)  =  {x2>)  =  1  -  , 

Pr(S"  (A)  =  X)  =  r(A)  ,  Pr  ( S  '*{  A}  =  0)  =  0  . 

Extensions  of  the  entropy  problem  to  multiple  coverage  func¬ 
tions  and  general  spaces  X  have  yet  to  be  addressed. 

5 . 5  Randoaized  tests  and  confidence  region  procedures. 

Consider  the  following  extensions  of  Lehmann  [154]: 

Let  Y  £  Rm  be  a  data  or  sample  space,  X  £  Rn  a  parameter 
space,  (vl0)0ex  a  famiiY  of  data  random  variables  over  Y  with 

corresponding  well-defined  probability  distribution  functions 
(fit,a,Pr)  be  a  fixed  non-atomic  probability  space  and 

=  {S  |  S  :  h  -»  ?( Y)  ,  S  a  random  subset  of  Y) 

and  similarly  denote  *(X)  and  d(X  x  Y)  as  the  classes  of  all 
random  subsets  of  X  and  X  x  Y  ,  respectively. 

The  class  of  all  randomized  confidence  set  procedures  is 
denoted  by 


* Pe  1  eex  ' 
Let 

denote 


*(Y) 


*  =  (R  |  R  : 

and  the  class  of  all 
given  by 

.a  =  (A  |  A  : 

Let 

*  -  ( (Hq(0 )  , 
be  the  fixed  family  of 

Hq(9)  =  (8) 


Y  -*  »(X)  (measurable, 
randomized  acceptance 

X  -»  *(Y)  (measurable, 

H : ( 0 ) )  |  0  €  X) 

hypotheses,  where  for 
H  (0  )  =  X  -)  (0)  £  X  . 


etc .  )  } 

region  procedures  is 

etc  .  )  }  . 

any  0  «  x  , 
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Let  (V|fl)._v  be  a  family  of  observation  random  variables  over 

Y  (i.e.,  ( V | 8 )  :  n  -  Y  (measurable,  etc.)  with  corresponding  pro¬ 

bability  distribution  functions  (pg)g€x  ' 

There  is  a  bijective  correspondence  between  7  ,  the  class  of 

all  randomized  decision  or  test  procedures  (test(8))  v  concern- 

ing  K  ,  and  a  ,  given  by 

f  Decide  HQ(e)  iff  observe  y  e  A(e) 

test(8)  |  Decide  H^e)  iff  observe  y  e  Y  w  A(e)  , 

for  all  8  €  X  . 

There  is  a  bijective  correspondence  between  at  and  a  and 
hence  o'  ,  given  by: 

(i)  Given  any  Rea,  define  An  e  a  by,  for  any  e  e  x  , 

K 

Ar(8)  ={y|y€Y&ee  R(y)}  . 

Then,  if  one  defines  SR  e  a ( x  x  Y)  by 

SR  =  <(9.y)  |  e  €  X,  y  €  Y  &  8  e  R ( y ) }  , 

it  follows  that  for  all  8  e  X,  y  e  Y  , 

S_ ( 8  ,  •  )  =  section  of  S_  at  8 
K  K 

=  (y  I  y  €  Y  &  (8 ,y)  €  SR>  =  Ar ( 8 ) 

S_(-,y)  =  section  of  Sn  at  y 

K  K 

=  (8  |  8  e  X  &  (9,y)  c  SR)  =  R(y)  . 

(ii)  Given  any  A  e  a  ,  define  Rft  e  »  by,  for  any  y  e  Y  , 

RA(Y)  =  (e  |  e  €  X  &  y  e  A(e ) )  . 

Then,  if  one  defines  e  a(X  x  Y)  by 

SA  =  (ft  ,  y)  |  8  €  X,  ye  Y&ye  A  ( 8  )  }  , 

it  follows  that,  for  all  8  e  X  ,  y  e  Y  , 

SA ( 8 , - )  =  section  of  SA  at  e 

=  (Y  I  Y  «  Y  &  (8  ,y)  e  SA)  =  A(8 ) 

SA(- ,y)  -  section  of  SA  at  y 

=  (8  |  8  e  X  &  (8 ,y)  e  SA)  =  Rft(y)  . 
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Call  any  pair  (  A ,  R  )  or  (A_,R)  compatible. 

A  K 

Conversely,  it  follows  that  S(X  x  y)  generates  all  possible 
compatable  pairs  of  randomized  confidences  set  procedures: 

Let  S  s  a(x  x  Y)  be  arbitrary. 

Define  Ag  €  a  and  Rg  e  a  by,  for  all  s  e  X  ,  y  e  y  , 

A  (•)  =  section  of  S  at  9 

=  {y  |  ye  Y  &  (9,y)  «  S)  , 

R  (y)  -  section  of  S  at  y 

={9  |  o  €  x  &  (s,y)  €  S)  , 

and 


Thus,  for  any  S  e  a(X  x  Y)  ,  there  is  test„  =  (test„(9)) 

O  b  9  €  A 

determined  by  randomized  acceptance  region  procedure  Ag  corres¬ 
ponding  to  confidence  set  procedure  Rg  such  that,  for  all  9  e  X  , 

Pr ( Decide ( for  testg ( 9 ) ) HQ ( 9 )  |9)  =  1  -  Pr(Decide  H  (e)  |  9) 

=  Pr ( V  €  As(9 )  |  9)  =  J  dPfl (y) 

Y€  As ( 9  ) 

=  Pr(9  e  Rg(V)  |  9)  (two-stage  randomness  due 

to  V  and  S  )  , 

In  particular,  these  relations  show: 

(i)  For  any  0  <  a  <  1  :  testg  e  cr  is  an  a  -  level  test 

procedure  ,  i . e  .  , 

Pr  ( Decide  ( for  testae))  H1(9)  |  9)  2  a  ;  for  all  9  e  X 
iff  Rg  is  an  (1  -  a)-level  confidence  set  procedure,  i.e., 

Pr ( e  e  Rg ( V)  |  9)  i  1  -  a  ;  for  all  9  €  X  . 

(ii)  Let  0  <  a  <  1  and  dQ  e  X  .  Then  testg (9  )  is  a 

uniformly  most  powerful  a-level  test  for  HQ(90)  vs.  1^(9  )  iff 
Rg  ,  among  all  (1  -  a) -level  confidence  set  procedures  R 

minimizes  Pr(9Q  €  R(y)  |  9)  ,  simultaneously,  for  all  9  e  H 1 ( e  Q )  . 

(See  Lehmann  [154]  for  other  properties.) 

In  addition,  Robbins'  result  [219]  (also  independentl 
rediscovered  by  Pratt  [205])  applied  here  yields  (see  also  sectio 
4.4): 


3  < 
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(S) 


Ev(vol  (RS(V)))  =  J  J  de  dPr° ( Rg ( V ) ) ~ 1 ( B ) 

Be  b  e €  b 
n 

=  J  J  d  Pr° ( Rg ( V ) ) ~* ( B )  de 
eex  Bec{e)(Bn) 

=  J  Pr ( 8  e  Rg(V)  |  e)  de 
sex 

=  J  Pr (V  e  A S(s )  |  9)  de 
sex 

=  J  Pr ( Decide  H  (9)  |  e )  ds  . 
sex 

Also,  note  the  evaluation  of  the  one-point  coverage  function 
:  X  x  Y  -*  [0,1]  of  S  at  any  e  e  X  and  y  e  y 

<P(S)  (9  ,y)  =  Pr (  (9  , y )  e  S) 

=  Pr ( V  e  A s(9)  |  v  =  y)  =  Pr(y  e  Ag ( e ) ) 

=  Pr ( e  e  RS(V)  |  V  =  y)  =  Pr ( 9  e  Rgty)) 

=  Pr ( Decide  (for  test s(9)}  HQ ( 9 )  |  V  =  Y  :  ®)  • 


Note  also  the  reductions  of  all  of  the  above  results  when 
S  =  C  a  Xx  y  ,  i.e.,  for  the  non-random  case. 

Thus,  in  this  notation,  for  all  8  €  X  ,  y  e  y  , 

E(voln(Rs(V)  |  V  =  y)  =  J  <t>(s)(8,y)  d9  ' 

9eX 

E(voin(Rs(V)))  =|  J  (9  ,y)  dP8(y)  de  , 
e  e  X  ye  y 

Pr ( Decide ( for  testg(e))  Hq ( e )  |  9)  =  Pr(V  e  Ag (8)  |  8) 

=  Pr (9  e  RS(V)  |  9 ) 

=  J  <?>s(e  ,y)  dP0  (y)  . 

ye  y 

Then  define  the  joint  admissibility  measure  map 

9  Yv  V 

P  :  7  ([0,1]  )  for  3t  by,  for  any  S  e  a(X  x  Y)  and  hence 

2 

any  testg  e  7  ,  as  the  map  p(S)  :  X  x  Y  -»  [0,1]  where  for  any 

e  e  x  and  any  y  e  y  , 


A 
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P(S)I9  ,y) 

=  (Pr ( Decide ( for  testg( 8 ) ) ^ ( 9 )  |  9)  ,  E ( vol^ ( Rg ( v ) )  |  V  =  y)) 

Partially  order  7  by,  for  any  S2<S2  €  x  ' 


test,,  = 

-  1 

test 

b2 

iff 

P(Sj) 

=  p(S2) ,  a . e . , 

test,  s 
bl 

test- 

b2 

iff 

P(Sj) 

<  p(S2)  ,  a . e . , 

test-  < 
bl 

test- 

b2 

iff 

P(Sj) 

*  P(S2)  , 

e  X  and 

for  a . 

,  e . 

y  «=  Y 

with  strict  inequality  holding 

for  either  the  first  component  at  some  9  €  X  or  for  the  cecond 
component  at  some  set  of  y  e  y  of  positive  Lebesgue  measure. 

Then  the  admissible  tests  in  7  a~e  those  which  are  not  less 
than  some  other  test  in  7  .  Other  criteria  may  be  designated  for 

comparing  the  worthiness  of  tests.  In  any  case,  these  comparisons 

may  be  carried  out  essentially  through  use  of  the  functions  $  g 

and  not  necessarily  directly  using  the  actual  corresponding  tests 
e  7  .  Joshi  [125]  was  apparently  one  of  the  first  to  recognize  this 

principle,  where  he  showed  the  admissibility  of  the  usual  confidence 
set  procedure  for  the  means  of  one-and-two-dimensional  Gaussian 
distributions.  (For  three  or  more  dimensions,  the  procedure  is 

inadmissible  as  Joshi  showed  in  an  earlier  paper.)  Joshi  also 
claimed  ([125])  that  given  any  $c  :  X  x  Y  -*  [0,1]  in  the  form  of 

a  simple  or  elementary  (a  countable  weighted  sum  of  membership 
functions  of  ordinary  sets)  function  corresponds  to  a  randomized 
confidence  set  procedure,  i.e.,  an  Sc  a(X  x  Y)  and  that  any 
(suitably  measurable)  <t>c  represents  the  limit  of  a  sequence  of 

such  procedures.  (See  G.  S.  Goodman  [93]  for  so,,  e  related  results 
from  a  fuzzy  set  viewpoint.)  Carrying  this  idea  further,  Hooper 
([118],  [119])  showed  that  given  any  suitably  measurable 

$c  :  Xx  Y-  [0,1]  ,  there  was  at  least  one  random  set  S  e  a(X  x  Y) 

whose  one  point  coverage  function  was  <p  :  namely  S  =  S ^ ( C )  ! 

Hooper,  clearly,  was  unaware  of  the  extensive  literature  devoted  to 
the  basic  one  point  coverage  problem,  but  indeed  did  use  the  funda¬ 
mental  transform  S^  in  reconstructing  random  set  S  whose  one 

point  coverage  function  was  specified.  (See  the  earlier  part  of 
this  section  for  various  results  in  this  problem.)  Furthermore, 
Hooper  ([119],  p.  550)  also  developed  independent  of  any  fuzzy  set 
literature  connections  (as  extensively  described  in  this  text)  in 
effect,  that  the  class  a(X  x  Y)  partitions  in"o  one  point  coverage 
equivalence  classes  indexed  by  all  possible  distinct  <£„  e  7(X  x  Y)  ! 

(Hooper  [239],  [240]  also  developed  important  natural  generaliza¬ 

tions  of  sufficiency  and  invariance  techniques  for  confidence  set 
procedures  in  terms  of  the  one  point  coverage  function  represen¬ 
tations  .  ) 

Thus  we  have  a  basic  tie-in  between  some  baric  fuzzy  set-random 
set  relations  and  results  in  classical  hypotheses  testing  ^nd  confi¬ 
dence  interval  theory! 
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In  summary: 

(a)  There  is  essentially  (up  to  measurability  considerations)  a 
bijective  relationship  between  all  random  subsets  of  a  parameter 
space ,  cross  a  data  space,  X  x  Y  ,  all  randomized  test  procedures 
for  all  simp’ e  null  hypotheses  vs.  complement  alternatives  of  the 
parameter  space  and  all  compatible  pairs  of  randomized  acceptance 
regions  and  confidence  set  procedures.  A  similar  relation  holds  for 
all  ordinary  subsets  of  X  x  Y  and  all  non-randomized  tests  and 
confidence  set  procedures. 

(b)  Most  criteria  of  interest  involving  such  tests  or  equivalently 
confidence  set  procedures  may  be  formulated  in  terms  of  the  one 
point  coverage  function  of  random  subsets  of  X  x  Y  and  (hence  by 
the  use  of  the  canonical  transform  Sy  ,  etc.),  in  effect  (up  to 

suitable  measurability  conditions),  in  terms  of  all  fuzzy  subsets  of 
X  x  Y  only.  For  example  Robbins'  result  [207]  shows  expected  vol¬ 
umes  of  a  randomized  confidence  set  procedure  are  the  integrals  of 
fuzzy  set  membership  functions  -  the  one  point  coverage  functions  of 
the  random  subsets  of  X  x  Y  . 
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CHAPTER  6 


ISOMORPHIC-LIKE  RELATIONS  BETWEEN  RANDOM  SETS 
AND  GENERAL  LOGICAL  SYSTEMS. 


Mappings  Sy  and  T  (Chapter  5}  are  special  one-argument  cases 

of  multiple  argument  mappings  S  from  cartesian  products  of  all, 

generalized  subsets  of  given  spaces  to  cartesian  products  (as  well- 
defined  random  objects)  of  all  random  subsets  of  the  same  spaces 
which  marginally  produce  one  point  coverage-equivalences  between  the 
generalized  and  random  sets  involved.  Again  (see  [90],  [88]), 

these  are  called  choice  function  families.  Next,  let  X. ,...,Xn 

and  X  be  any  n  +  1  given  spaces  and 

*  :  9  (X.)x.  •  x»(Xn)  -*  9  (X) 

any  given  n-ary  ordinary  set  operation.  Often  (but  not  necessarily 
so,  in  general),  *  is  the  power  extension  of  the  mapping 

*  '  :  X  x  •  •  x  X  -  X, 

1  n 

where  for  any  «  ’*Xj  )  -  j  =  ■*••••  <n  , 

*(C1 . Cn)  =  (*'(x1 . xn)  I  xj  €  cj  ,  j  =  1 . n)  . 

Also,  let 

*  :  9  (X:  )x-  •  (Xn)  -*  9  (X) 

be  any  given  n-ary  generalized  set  operation  (defined  through  the 

membership  functions,  of  course). 

Finally,  let  5  =  (S.)._.  be  a  choice  function  family  of 

jj— i,...,n 

random  set  maps,  i.e.,  for  any  space  X  , 

S j  =  S  :  *(X)  -  *(X)  , 

such  that  for  any  spaces  X^  ,  and  any  A  ^  e  sf  ( x  ^ )  ,  j  =  l,...,n  , 

the  random  sets 

Sj(Aj)  :  B  -  *(Xj)  ,  j  =  1 . n  , 

with  respect  to  some  initial  probability  space  n  ,  yield 
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(Sj(Aj})j  =  l . n  : 

where  marginally 


n 

x  »(X,) 

j  =  l  ° 


as  a  well  defined  random  set, 


Sj(Aj)  “  Aj  '  j  =  1 . n  ‘ 


Then  we  say  that  the  triple  (*  :  *,S)  is  in  an  isomorphic-like 

relation,  also  called  a  w-hom  ( weak-homomorphism } ,  iff 


* ~n  -  S(*V  ^  *S(An)  , 

i.e.,  for  all  A^  e  5(Xj)  ,  j  =  l,...,n  ,  x  e  X  , 


<P;A  (*)  ~  4>;(A  (*)  =  Pr(x  €  S(‘V  > 

~n  j  J=1  ,.9n 


=  Pr ( x  €  *  S { A  ) )  =  Pr ( x  «  *  ( S  .  ( A  . ) ) )  . 

~  “n  ^  =  1,..^,^ 

Note  that  in  the  above  equation,  it  is  assumed  that  *  S(A  ) 

is  a  well  defined  random  set.  More  on  this: 

Using  standard  procedures,  it  is  easily  shown  that  if  I  is 
any  index  set, 

S(A  )  3  (S(A  )  )  .  :  a  -  x  3>(X.) 

I  j  j«I  3 


where  («,ai,Pr)  is  some  fixed  probability  space  and  for  all 
wen,  StAjMw)  =  (S(Aj)  (w)  )j€l  ,  S(AX)  will  be  (a.G)- 

measurable ,  where  e  =  o(  x  o(e  (»  (X.))))  ,  inducing  probability 

jel  3 

space  (  x  ?(X.),  G,  w)  ,  y  =  Pr°S(A  )  *  .  Furthermore,  if 
jel  J 


*  :  x  -MX.) 

j€l  J 
a-algebra  ‘J 


-»  tp  ( Y )  is  any  ( 6  ,7  )  -measurable  mapping  for  some 
over  ‘J>(Y)  ,  then  *°S(Aj}  :  n  -  ?(Y)  will  also  be 


measurable.  In  particular,  since  ordinary  set  operations  *  in¬ 
cluding  u  ,  n  and  C  are  measurable,  (and  continuous  relative  to 
the  natural  topologies  involved),  it  follows  that  *°S(A^)  for 


*  =  u  ,  n  ,  c  ,  etc.,  are  all  measurable  and  hence  random  subsets  of 
appropriate  Y’s. 

Define  for  any  A  €  7  (X)  here  the  complement  of  A  ,  CA  (or 
X  -i  A  ,  etc .  )  by 
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$  (x)  =  1  -  <|>a(x)  ;  all  x  e  X  , 

CA 

and  similarly,  if  f  :  X  -*  Y  is  any  c^Jinary  function,  for  any 
B  €  5(Y)  ,  x  €  X  , 

<?> — ,  (x)  =  <J>R(f  (x)  )  . 

f  (B) 

Then,  it  can  be  shown  that  if  S  is  any  choice  function,  then 

(C  ;  C  ,  S )  is  a  w-hom  and  if  f  :  X  -»  Y  is  sufficiently  measurable, 

then  ( f_1  ;  f  1,  S)  is  a  w-hom.  If  fuzzy  set  system  5  is  non- 
Archimedean,  and  S  is  any  choice  function,  then  f  and  projection, 

proj,  induce  modified  w-hom 1 s ,  where  »  is  replaced  by  »  ,  Further¬ 


more,  if  (x,  x,  S)  is  a  w-hom  for  fuzzy  set  system  5  ,  then  for 

any  A  e  5(X),  x.  e  X.  ,  x,  e  X.  ,  the  pair-wise  correlation  of 

3  J  K  K 

random  set  coverage  -  COf-rel($s  (A  <f>s  {A  j(k.))-is: 

j  j  k  k 

(i)  maximized  for  the  evaluation  5  =  5Q  , 

(ii)  made  zero,  i.e.,  uncorrelated,  for  5  =  5  . 


(iii)  minimized,  i.e.,  made  most  negative,  for  n  =  2  ,  for 
7=5 


It  can  be  shown  ([91],  Theorem  5.1)  that  if  (x  ,  x  ,  S)  is  a 

w-hom,  then  (F  ,  t  ,  S)  Is  also  a  w-hom  (for  the  same  S)  iff  the 

corresponding  fuzzy  set  system  is  semi-distributive.  (See  section 
2.3.6  for  notation.) 

Let  us  consider  now  some  particular  classes  of  choice  function 
families  and  associated  w-hom' s. 


First,  let 


t>  be  any  J  -copula  (see  [91])  and  let  stochastic 

w  O 


process  U  =  (U.)  .  be  such  that  each  U.  is  a  random  variable 
J  J€j0  J 

uniformly  distributed  over  [0,1]  with  joint  probability  distribu¬ 
tion  function  furnished  by  <t> „  : 

& 

Pr(Ul  5  X1 . Un  5  V  =  VX1 . V  ’ 


for  any  xx . xn  €  [0,1]  .  Then  it  follows  that  U  induces 

choice  function  family  S  =  ( SM  )  .  _  where  for  DeMorgan  fuzzy  set 

~  uj  ■>  Jo 

system  5  =  (l-(-),  <pQr)  ,  it  readily  follows  that  (*  ;  *  ;  S^) 

is  a  w-hom  for  *  =  x  ,  n  ,  t  ,  u  .  Also,  for  any  C..  e  5(X^)  , 
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Aj  «  * (Xj)  ;  j  €  K  S  Jq, 

Pr(  &  (C.  £  S  (A.)))  =  Pr(  &  (C  £  4>._1  [U  .  ,  1  ]  )  ) 
J«K  3  Uj  3  j«K  3  j  3 

=  Pr{  &  (U  <  inf  $  (x)  )  ) 


j«K 


„  A  . 
xeC  .  1 

3  J 


=  <t>&  (inf  (}>A  (x)  )  . 
j«K  “Cj  J 

Note  that  when  =  min  ,  we  obtain  Sy  =  Sn  and  all  of  the 


U 


w-hom  results  remain  valid.  Also,  for  <p&  =  prod,  U  becomes  a 
statistically  independent  uniform  [0,1]  process  and  Sy  likewise 

induce  statistically  independent  Sy  (A.)'s  and  w-hom ' s  for  unions, 

j  3 

intersections,  and  projections,  etc. 

Next,  let  9  €  (semi-distributive  class)  and  for  any 
A  =  (Aj)j€j  ,  Aj  €  sf  ( X)  ,  for  all  j  e  3  Q.  let 

( A )  =  (<^T(A)  <X)  }X€X  =  (VA,&,x}x€X  =  VA  ,  & 


be  a  zero-one  process  such  that  each  V.  „  is  statistically 

A ,  &  ,  x 

independent  with  respect  to  different  x  e  X  and  where  each 

d 


V 


( V  ) 

A , & , x  A j , & , x , j  je 


is  a  zero-one  process  finitely  consistently  generated  by  the  rela¬ 
tions 


Pr (  &  (V 
j€K 


A  ,  &  ,  X  ,  j 


=  D 5  =  4>&  (1>a  (x) )  ;  x  £  i  (JQ) 

jex  J 


Each  Tj(Aj)  ,  uniquely  corresponds  to 

j  <  A j )  =  =  (VAj(&,x, j}x€X  ' 

a  T-type  random  subset  of  X  . 

For  any  e  9  (X)  and  A..  €  ?(X)  ;  j  €  K  £  JQ  , 


Pr  (  &  (C.  £  T  .  ( A . ) 


j«K 


3  3 


TT  (  (  <PA  (X)))  , 

x  €jeKCj  (k)xeCk,k€K}k 


implying  that  (n  ;  n  ,  T)  is  a  w-hom. 
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Another  family  (see  Hohle  [116]  and  [117]  -  the  latter  pre¬ 

senting  very  general  results)  of  choice  functions  may  be  generated 
by  choosing  any  fuzzy  set  system  s  =  ( 1  —  ( -  )  ,  $&,  <|>  )  «  /e  (the 

semi-distributive  class  -  see  earlier  remarks,  recalling  that 
examples  of  these  fuzzy  set  systems  include  ( 1 — ( -  )  ,  prod,  prob- 
sum)  ,  ( 1  —  { -  )  ,  min,  max)  and  all  ordinal  sums  of  these  (section 

2.3.6)  . 


Then  for  any  fixed  collection  of  spaces  X  =  (X.)  .  , 

~  3  3=1 . n 

and  a  collection  of  fuzzy  subsets  A  =  (A.)  .  ,  frcm  X 

j  3=1. 


first  a  doubt  measure 


j ' 3=1 , . . . ,n 
is  determined  over  the  class  of  all 


subsets  C  of  X  by  (first  evaluating  for  finite  sets  and  extend¬ 


ing) 


Vcr 

j  =  l . n 


<V<x3))  • 

3  J 


Similarly,  define  plausibility  measure  fj A  .  by 

~  '"or 


(^A  .  (xj ) )  • 

J 

Lj  =  i ,  .  .  .  ,n. 

Then,  by  previous  results,  there  are  uniquely  determined  joint 
random  set  families 


A,4> 


<C)  9 


or 


'or 


W 


§fifA)  = 


( S ^ ( Aj , < 


}  1  J  =  1  , 


,  n 


and 


S  (A)  =  (S  .  (A  .  ,<J>  )) 

~or  -  33  or  ' 

such  that  not  only  for  all  A 


j=l , . . . , n 
fuzzy  sets 


§&(A)  *  A  ;  Sor(A)  «  A  , 


yielding  S&  and  SQr  as  choice  function  families,  but  also  such 
that  for  all  combinations  of  and  (end  cartesian  product 
and  sum  -  and,  in  particular,  intersection  and  union) 


Pr(  comb  (&,or)(C  .  £  S.  (A.  ,  cf)  )  )  ) 

all  j , k  3,K  3  3  & 


comb 
all  j  ,  k 


V*or)(*& 


V!xj})) 

xJ€CJ,kJ 


j  >  k 


;  all 
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with  a  similar  relation  holding  for  SQr(A)  ,  with  respect  to  in¬ 
cidences  in  place  of  inclusions.  This  implies  isomorphic-like 

relations  between  multiple  argument  fuzzy  f  and  ordinary  set 
operation  f  :  X  -»  Y  extended  to  the  respective  power  classes, 

where  f  as  obtained  by  the  extension  principle  discussed  earlier, 
i.e.,  for  all  y  «  Y  and  all  (fuzzy)  A  , 


*f(A)(Y)  = 


(<Xi> 


«  r  (y  f)  j  =  l ,  •  •  •  ,n 
j  =  1 ,  .  .  .  ,  n  ' 


(<K  (x.)  ))  . 

ti  2 


Thus,  for  all  A 


f  (A)  *  S&(  f  (A)  ;  <|>&)  *  f  (S&(xA;<j>&)  )  . 

This  should  serve  as  a  second  response  to  Manes'  complaints  [170] 
(see  also  10.2  A),  since  the  cartesian  products  and  sums  (for  ?  e  f<t) 
form  simultaneous  w-hom's  with  respect  to  members  of  the  families. 
Arithmetic  operations  (such  as  fuzzy  sums,  functional-cartesian 
product  transforms,  fuzzy  products,  etc.)  also  form  w-hom's  for  S& 

and  S 

~or 

For  subset  inclusions,  the  following  holds  for  any 
Aj  .  A2  €  5(X)  : 


"<A1  £  V"  •  «V  Z  A  <x>>.  -  Pr(SS.l|Sl>  5  Sor ,  2  'A2 ' ' 

xeX  1 


The  above  results  generalize  the  single  argument  choice  func¬ 
tions  S,  and  S  where,  for  any  A  e  ?(X)  ,  is  a  doubt 

&  or  a  ,  u 

measure  and  or  is  a  plausibility  measure  obtained  by,  for  all 

C  «  J (X)  , 


by  extending  the  domains  from  3  (X)  to  ?(X)  via  Theorem  3, 
Chapter  4.  Note  also  that 
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S  (A)  =  S  ;  T ( A)  =  S  ;  all  A  e  5 ( X)  . 

u  *A,min  ^A.prod 

Sy  is  not  a  part  of  the  S&  nor  SQr  family,  in  general, 

except  when  U  is  chosen  to  be  identical  to  U  ,  i.e.,  IK  =  U  , 

for  all  j  e  J  ,  whence  =  min  . 

T  is  not  a  part  of  the  S„  nor  S  family,  in  general, 

-or 

unless  <f>&  is  chosen  to  be  prod,  in  which  case,  & 

becomes  a  completely  statistically  independent  (with  respect  to  all 
indices)  zero-one  process. 

A  basic  question  is  to  determine  what  fuzzy  set  systems  exist 
which  lead  to  simultaneous  w-hom's,  for  either  the  Sy  family  or 

the  T  family,  for  cartesian  sums  and  products  and  possibly  com¬ 
plements,  and  for  arbitrary  combinations  of  these  operations.  In 
the  last  case,  it  may  be  interpreted  that  a  generalized  set  system 
involved  in  such  a  relationship  is  identifiable  with  a  correspond¬ 
ing  random  set  system. 


Theorem  1  (Identification  of  Generalized  Set  Systems  with  Random 
Set  Systems.) 

(a)  7  is  semi-distributive  iff  (x  ;  x  ,  S)  and  (t  ;  r,  S)  are 
simultaneously  w-hom's  for  S 

iff  (combo(x  ,  t  )  ,  combo(x,t),  S)  is  a  w-hom, 

where  S  =  Sy  or  S  =  T  ;  in  the  latter  case,  replace  x  by  n 
and  t  by  u  everywhere . 

(b)  5  is  semi-distributive  and  DeMorgan  iff  (a)  holds  with  the 
inclusion  of  C  in  the  combo  operators  and  a  modification  of  cer¬ 
tain  random  set  representations  (those  corresponding  to  an  odd 
number  of  DeMorgan  transposes  to  be  clear  of  all  complements)  where 

in  places  S^(A^)  is  replaced  by  C S ^ ( c  Aj )  . 


(C) 

Assuming  for 

7  that 

■a 

-e- 

<0 

(x,y)/<5y)  Q  exists  for 

all 

X  € 

[0,1]  ,  then 

5  = 

iff 

(b) 

holds  with  no  modifications  of 

random  set  representations. 

(d) 

Consider  the 

case  S  = 

sy 

and 

7  a  DeMorgan  fuzzy 

set  system: 
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(1)  U  Is  a  statistically  independent  process  iff  5  =  5 
whence  (c)  applies; 

(ii)  U  is  an  identical  process,  i.e.,  B .  s  0  ,  iff 
5=5,  whence  (b)  is  applicable  with 

C  Sy(C  Aj)  =  S1_y(Aj)  (with  probability  one) ; 

( iii )  For  U  =  (U.,U0)  only  :  U.  =  1  -  U  iff  ?  =  cf 
However,  although  (x  •  x,  Sy)  is  a  w-hom, 

(t  .-t.Sy)  is  not  simultaneously  a  w-hom. 


(e)  Yager's  DeMorgan  family  n)  yields  for  each 
ponding  ,  generating  uniform  [0,1]  process 


P 

U 

~P 


and  corres- 


(  x  ;  x  ,  s 
P 


is  a  w-hom,  but  (t 

P 


Sy  )  is  not  simultaneously  a  w-hom. 
~P 


Parameter  p  can  be  shown  to  be  directly  proportional  to  the  pair¬ 
wise  correlation  of  random  set  coverage  here  [91]. 

Three  basic  problems  arise  concerning  relations  between  fuzzy 
and  ordinary  set  operations  with  respect  to  weak  homomorphism. 


Basic  Problem  1 ■ 

Given  any  ordinary  n-ary  set  operation  *  ,  what  ^choice  func¬ 
tion  families  S  exist  and  what  fuzzy  set  operations  *  exist  such 

that  (*  ;  * , S )  is  a  w-hom? 


Theorem  2  (Solution  to  Basic  Problem  1  -  See  [81].) 

Given  any  *  and  any  S  ,  there  exists  a  unique  *  such  that 

(*,  *,  S)  is  a  w-hom.  The  relationship  is  given  explicitly  by,  for 
all  x  €  X  ,  A  «  5 (X)  , 


t;  *<*)  -  PrOs,*,  «  -  f  ♦  , 

zez  <?»  °x,  1 


U)  d  us(A)(z)  ; 


X, 


(4>c)  =  ;  Zn  =  x  {0,1}  J  ;  c  €  9  (X)  . 
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In  addition,  *  extends  *  ,  i.e.,  for  all  C  e  »(£)  . 

*  c  =  *c  . 


Corollary  1 


If  * 
g,  :  (0,l)n 


(Basic  Problem  1  for  Compositional  Operations.) 

is  a  composition,  i.e.,  there  is  a  fixed  function 
(0,1)  ,  with 


<M4>C)  =  g,°  (1>c)  :  all  C  e  ?(X)  , 


then 


(as  [  0 


,7} 


(g* (a) • Pr(x€ (  n 

a  =1  J 


S,(A  )  H  (  u  S  (A  ) ) ) )  ;  x  e 

J  a^=0  J  J 


Ordinary  binary  composition  set  operations. 

2 

Let  g„  :  (0,1)  -*  (0,1)  be  arbitrary.  There  are  16  possi¬ 

ble  gt's  ,  analogous  to  the  16  possible  classical  logical  binary 
truth  operators.  Let  X1  =  X2  =  X  be  arbitrary  spaces.  Then,  each 
g,  corresponds  uniquely  to  a 

:  Z2  =  {0,1}X  X  { 0 , 1  }X  -  (0,1}X  and  *  :  X  X  X  -  X  such  that 

for  any  C^  e  »(X)  ,  j  =  1,2, 

$(*(  c1,c2))  =  <?>(c1*c2)  =  <M*Cj*4>c  )  =  g,°  (<t>c  .*c  )  • 

Now  let  S j  :  ?(X)  -•  a(X)  be  arbitrary  such  that  for  any 
A j  €  7(X) 

sj{ V  •  Aj  :  j  =  i-2  •  (• ) 

It  follows  readily  that  the  joint  random  sets 
( s 1 ( A 2 ) ,  S2(A2) )  may  be  arbitrary  such  that  for  all  x  e  X  , 

f  (  A  j  ,  A2  ;  x  )  =  Pr  (  x  €  S1(A1)  n  S2  (  A2  )  )  =  Pr  { x  €  S^A^,  x  €  S2  (  )  ) 

=  Pr(<p  (x)  =  1  ,<?>  (x)  =  1 )  J 

A1  A2 

L  (  A  ^  ,  A  2  ;  x  )  S  f(A1,A2;X)  5  U  (Aj.Aj.-x)  , 

U ( Aj , Aj ; x )  =  min  Pr(x  e  S1(A1),  Pr(x  e  S2(A  ))) 

=  min  (<p.  (x),  <$>  (x))  ,  the  maximal  t-norm 

A1  A2 
evaluation. 
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L(ArA2;x)  =  max  (Pr(x  e  S^A^)  +  Pr(x  e  S2  ( A2 )  >  -  1,0} 

=  minsum  (<}>  (x)  ,  (x)  )  -  the  minimal  t-oonortn 

A1  h2 
evaluation , 

where,  in  general,  there  does  not  necessarily  exist 
g  :  {  0 , 1 }  2  -»  {0,1}  ,  such  that 

Pr(tpS1(A1)(x)  =  1  '  ^S2(A2){X>  “  X)  =  3(1>Ai(x},  ^(x)) 

for  all  x  e  X  ,  j  =  1,2.  (Sklar's  Theorem  [226]  is  too  weak  for 
use  here.  See  also  the  more  general  bounds  L,U  discussed  in 
section  2.3.9  ( A) 

Next,  for  each  given  choice  function  family  S2  =  (s1>s2) 

(with  S j , S 2  arbitrary  satisfying  eq.  (•)  and  for  any  A ^  e  5(X)  , 

j  =  1,2,  (S1(A1),  S2 ( A2 ) }  being  a  well-defined  joint  random  set), 

by  either  direct  expansions  of  each  of  the  16  operators  *  or  by 
specializing  Corollary  1  to  n  =  2  ,  it  follows  that  16  unique  bi¬ 
nary  generalized  set  operations  *  may  be  defined  such  that 
(*  ,  * ,  S  2 )  is  a  w-hom,  i.e.,  for  all  A.,  e  ?_.(X^.)  ,  all  x  e  X  , 
j  =  1,2  , 

Pr(x  e  (SjtAj)  *  S2(A2)))  =  tAi*A  (x>  : 
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g,  defined  by 

crc2 

Va  ,lxl 

its 

evaluation  over: 

for  any 

for  any  ,  A2 

e  ?(X)  ;  x  e  X 

Operator 
No . 

(l.D 

(1.0) 

(0,1) 

(0,0) 

C  .  c. 

1  2 

€  3><X) 

1 

i 

1 

1 

1 

X 

1 

2 

i 

1 

1 

0 

C1  U  C2 

*?»  <x>  +  <x>  -  f 

A]  a2 

3 

i 

1 

0 

1 

C2  U  (X  -1 

C2> 

i  -  4>.  (x)  ♦  f 

A2 

4 

i 

1 

0 

0 

C1 

♦a  (x) 

A1 

5 

i 

0 

1 

1 

c2  U  (X  -1 

Cl> 

1  -  <t>.  (x)  +  f 

A1 

6 

i 

0 

1 

0 

C2 

1>A  (X) 

A2 

7 

i 

0 

0 

1 

X  -1  (Cj  ^ 

V 

1  -  «t>A ( X )  -  <fA  (X)  + 

8 

i 

0 

0 

0 

Cin  c2 

f 

9 

0 

1 

1 

1 

X  H  (CjO 

V 

1  -  f 

10 

0 

1 

1 

0 

C1  *  C2 

(x)  +  <t>  (X)  -  2f 

1  A2 

11 

0 

1 

0 

1 

XH  C2 

1  -  <}>  (x) 

A2 

12 

0 

1 

0 

0 

C1  -1  C2 

<J>  (x)  -  f 

1 

13 

0 

0 

1 

1 

X  H  C 

1  "  <P,  (x) 

1 

14 

0 

0 

1 

0 

C2  H  C1 

<P.  (X)  -  f 

A2 

15 

0 

0 

0 

1 

x  -i  (c  u 

C2) 

1  -  <p.  (X)  -  If  (x)  + 
A1  A2 

16 

0 

0 

0 

0 

2 

0 

Table  I. 

Generalized  Set  Operator  Extensions  Isomorphic-Like 
(w-hom)  to  Ordinary  Binary  Composition  Set  Operators. 

Thus,  for  example,  consider  operator  2,  union.  By  first 
choosing 

f  =  min  (<f>  (  )  ,  <p  (  )  )  , 

1  A2 
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occurring,  for  example,  for  S.(A.)  a  S  (A.)  -  A  [U,l]  ,  U  a 

3  3  u  j  A  j 

uniform  [0,1]  random  variable,  then  for  all  x  e  X  ,  Aj  e  ?(X)  , 

j  =  1,2  . 

$A  u  A  (X)  =  maX  (<fA  (X)'  V,(x)> 


=  Pr ( x  €  SyfAj)  u  Sn'A2))  . 


Or  by  choosing 


t  «  <t>A  (-)  • 

A1  A2 

occurring,  for  example,  for  S.(A.)  =  S.,  (A.)  =  <p  1[U,1]  , 

33  3  Aj 

j  =  1,2  ,  where  U  and  U2  are  statistically  independent  uniform 

[0,1]  random  variables, 

<t>Al  13  A2<X)  =  <?Ai(X>  +  <t’A2{X)  ~  '^A1  (x)  '  *A2  (x)  ’ 


=  probsum(<]>  (x),  $  (x)  ) 

A1  2 


Or  by  choosing 


f  =  max  (<p .  («)  +  $A  (•)  -  1,0)  , 

1  2 

occurring,  for  example,  for  S^.(Aj)  =  (A..)  =  $A  » 

j  =  1,2,  where  =  1  -  U2  are  uniform  [0,1]  random  variables, 

i  A  ( x )  =  min  (t  (x)  +  <}>  (x),  1) 

A 1  2  A1  2 

=  minsum  ( <f>  (x),  4>A  (x))  . 

A 1  2 

Note  also  the  table  includes  the  four  unary  composition  exten¬ 
sion  forms: 


g,  d  ’fined  by  *(C)  <)>  ;  ,..(x)  < 

Operation  its  evaluation  over:  for  any 

No.  1  I  0  C  e  »(X)  for  any  A  e  SF(X);  x  €  X 


VX) 

1  -  *A(x) 


Table  II. 


Generalized  Set  Operator  Extensions 
Isomorphic-Like  to  Unary  Compositions. 
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Basic  Problem  2. 

Given  an  n-a^y  generalized  set  operation  *  ,  does  there  exist 

a  choice  function  family  S  and  an  ordinary  n-ary  set  operation  * 

such  that  (*  ;  *,  S)  is  a  w-hom? 

First,  it  can  be  shown  that  there  is  at  most  one  *  such  that 

(*;  *,  S)  is  a  w-hom  (for  any  possible  S):  namely,  *  restricted 

to  ordinary  subsets.  ( Proof :  Restrict  the  definition  of  a  w-hom 

to  ordinary  subsets  and  use  the  fact  that  the  one-element  equiva¬ 
lence  classes  under  »  are  generated  by  the  class  of  all  ordinary 
subsets . ) 

Secondly,  the  aforementioned  examples  of  choice  function  fami¬ 
lies  S„  ,  T  ,  Sr  ,  S  show  that  many  such  w-hom 1 s  exist.  In 

U  ~  ~or 

addition,  it  can  also  be  shown  (Goodman  [91],  Cor.  6.2)  that  fuzzy 
partitionings,  and  in  particular,  all  fuzzy  weighted  sum  operators 
(not  t-norms  nor  t-conorms,  in  general),  have  natural  corresponding 
weak  homomorphic  random  set  representations.  These  are  in  the  form 
of  disjoint  unions  of  intersections  of  corresponding  pairs  of  sta¬ 
tistically  independent  weak  homomorphic  T  -  type  random  sets. 
Nevertheless,  there  do  exist  classes  of  fuzzy  set  operations  which 
have  no  random  set  counterparts  which  preserve  the  number  of  argu¬ 
ments  of  the  operator.  (See  *.lso  Tteofvm  4  ,  ^ereO 


So«e  binary  fuzzy  set  operations  with  no  binary  random  operation 

represents  ions. 

Let  be  the  open  unit  interval,  D2  be.  the  open  unit  square, 

and  g2  be  such  that  for  all  x,y  e  D2  , 

g2  :  °2  -  ;  g2(x,y)  =  x  . 

Then  let  g  be  arbitrary  such  that  g  :  [0,1]  x  [0,1]  -»  [0,1]  , 

g  *  g2  (over  D2)  ,  g(0.0)  =  g(0,l)  =  0  ;  g(l,0)  =  g(l.l)  =  1  . 

In  turn,  define  the  binary  fuzzy  set  operator  (*)  over  s(X)  by 


<K  (*)  A  (x)  ?  9 ( a  :  A  A  e  7(X)  ,  x  €  X  . 

Al(  'gA2  A1  A2  1  i 


Then  ( ( * ) 

g 

binary  operator 
selected . 


,  S)  is  never  a  w-hom,  no  matter  what  ordinary 
and  choice  function  family  S  =  are 
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A  related  construction  yields  an  analagous  lack  of  possible 
unary  set  operator  representations  for  a  class  of  unary  fuzzy  set 
operators:  Choose  g  :  [0,1]  -*  [0,1]  to  be  arbitrary  such  that 
g  £  identity,  1-identity , 0 , 1  .  In  particular,  this  implies  if  7 

is  Archimedean,  the  qth  intensification  operator,  which  linguis¬ 
tically  can  correspond  to  ”very(-}",  ’’very,  very)  ■  )  ”,  etc., 
depending  on  the  integer  q  chosen,  where  for  all  x  e  X  and  all 
A  €  9 (X)  , 


v*^ 


-(X)  = 


Wx: 


$A ( X )  )  ,  ■ 


..4>a(x>) 


factors ) 


has  no  random  set  unary  operator  counterpart,  although  by  Theorem  2, 

for  many  such  fuzzy  set  systems,  (•  )^q^  does  hav~  a  q-ary  random 
set  representation  with  A1  =  A2  =  • •  =  A^  =  A  for  the  w-hom 

(  (  )q  ;  n,S)  ,  i.e.  ,  for  all  A  €  <f(X)  , 


Aq  =  s(Aq  )  S  S ( A  n  An  • • n  a )  a  S1(A)n  s2 (A)n- -ns  (A)  . 


Basic  Problen  3. 

Are  there  weak  homomorphic  relations  between  given  generalized 
set  operators  which  do  not  depend  on  specific  choice  function  sub¬ 
families  for  some  large  class  of  choice  function  families? 
Equivalently,  how  closely  do  weak  homomorphic  generalized  and 
ordinary  set  operations  resemble  true  algebraic  equivalence  class 
operations,  where  the  operations  do  not  depend  on  the  particular 
choice  of  equivalence  class  representative? 

A  partial  affirmation  to  Problem  3  is  given  in 


Then  define  n-ary  generalized  set  operator  (_*j^  over  3(X)  ,  by,  for 

all  A  «  5 (X)  ,  x  e  X  , 


IsoMorphic-Like  Relations 
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%U(X>  ”  Z-1  I  1  1  rk  i^A  (x 

'  k«f  (1)1  j«I  °  j 


0)  • 


If  n-ary  composition  operator  over  ?(X)  is  defined  by,  for 

all  C  e  3>(X)  ,  x  e  X  , 

♦  ;  ,r.(x)  =  f  ( (x))  . (x))  , 

f(S)  C1  n 

then  it  follows  that  for  any  statistically  independent  choice  func¬ 
tion  family  S  =  (S.)  .€l  (i.e.,  for  any  A  e  5(X)  ,  the  S.(A.)’S' 

J  J  J  J 

are  all  statistically  independent),  (*^  ;  ,  S)  is  a  w-hom. 

In  [91]  (Theorem  6.5  and  Corollary  6.2),  isomorphic-like  re¬ 
lations  were  established  between  fuzzy  partitionings  -  and  in 
particular,  between  weighted  sums  of  membership  functions  and  random 
set  partitionings! 


Theorem  4.  (Weak-Homomorphic  Representations  for  Fuzzy 
Partitionings. ) 


Let  K  =  (l....,m)  and  let  B  =  (  B  . )  .  be  a  fuzzy  par t  i  t  i  on- 

~  J 


in g  of  X  ,  i.e.. 


t  (x)  =  1  ,  for  all  x  e  X  . 

j«K  Bj 

Then  T^^(B)  =  (T.^^(B))  .  v  is  a  random  partitioning  of  X 
~  3  J€  K 


1  =  Pr(X  =  U  T  '  1  (B  .) )  , 


with  each  T 


being  a  T-type  choice  function; 


T(1)(B)  *  B  ;  T  (1)(B)  «  B',g  e  K 


may  be  constructed  as  follows:  Let 

V ( B )  =  {V  { B ) )  be  a  zero-one  process  where, 

~  ~  j  ,  x  —  3€  K 


Vx'®1  *  Vx'V  • 


V*1®’  ■  11  -  “i.x'V  •|1  -  Vi.*lVi,l'":i.x(V 

for  j  =  2 . m-1  , 


V  (B)  =  (1  -  W  ( B .))••■( 1  -  W  ( B  ) )  ,  where 

m ,  x  ~  l  ,  x  i  m ,  x  m 
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<Wj,x<V> 


is  a  statistically  independent  zero-one  process 


with  Pr  ( W  ( B  j )  =1)  =  <pn  (x)  /  Z  (x)  ;  j  e  K,  x  e  X  .  Then 

m,x  3  B.  i=. 

define  .  .  (x)  -  V.  (B),  j  e  K,  x  e  X  :  thus 

T  .  1  ( B )  3'X  ~ 

J 

$  .  =  V  ( B )  ,  and  T(1){B)  =  1  ( V  ( B )  )  ,  <p  being  the  1-1 

T(  '  (B) 

V 

onto  membership  function  mapping  $  :  »(X)  -♦  {0,1} 

Corollary  (Weak-Homomorphic  Representation  for  Weighted  Sum 
Operator . ) 

For  any  constant  t ,0  <  r  i  1  ,  define 
9  e  5(X)  by  A  (x)  a  t  ,  for  all  x  e  X  .  Let  K  ={l,...,m)  and 

T  a 

T 

let  0  <  a  <  l  be  arbitrary  constant,  j  e  K  ,  with  r  A  .  =  l  . 

3  jeK  3 

Let  9  =  (9  )  .  .  Define  the  extended  weighted  sum  operator 

A  Aj 

wtdsum  ^  :  s(X)K  -♦  s(X)  ,  by,  for  any 
A  ^  (A  )  «  Aj  €  *(x).  j  «  K,  x  €  X  , 

♦wtd.u.,14)""  i  V'"' 

where  n  is  determined  by  <f>&  =  prod.  Then 

T(1)  ^  (Tj(1,)j€K  3nd  ?{2)  =  {Tj<2))jeK  are  tWO  statis~ 
tically  independent  T-type  choice  function  families  such  that 
T ^  1  ^ ( 0 ^  )  is  a  random  partitioning  of  X  and 

wtdsum. (A)  *  u  (T.(1)(8l  )  n  T.(2)(A.))  . 

~  ~  j«K  3  Aj  3  3 

T ^ 1 3  ( ® )  is  the  same  as  in  Theorem  4  with  B  replaced  by  e 


T(2)(A)  =  (T  (2)(A  ))  is  such  that  (<$>  (x))..,.  is  a 

3  3  3€K  T .  2  ( A . )  3e^ 

3  3  xeX 

statistically  independent  zero-one  process  which  is  also  indepen¬ 
dent  of  such  that 
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Pr($  .  .  (x)  =  1)  =  <p.  (x)  ,  for  j  €  K  ,  x  €  X  . 

V  <v  j 

The  proofs  of  Theorem  3,  Theorem  4  and  its  corollary  can  be 
found  in  Goodman  [91]. 

When  there  is  no  confusion,  unless  otherwise  indicated,  we  will 
use  the  same  notation  for  ordinary  set  operations  and  their  general¬ 
ized  set  extensions. 


Summary 

Thus,  in  summary,  any  generalized  subset  A  of  X  may  be  con¬ 
sidered  as  a  weakened  form  of  a  random  set  S  ;  A  corresponds  to 
the  collection  of  all  random  sets  one-point  coverage  equivalent  to 
it.  At  the  same  time,  the  corresponding  possibility  function  of  A 
may  be  considered  as  the  evaluation  function  of  a  random  variable, 
over  an  elementary  event  space,  for  corresponding  compound  events 
which  may  be  identified  with  the  elements  of  X  .  By  choosing  par¬ 
ticular  random  set  representations  for  generalized  sets,  isomorphic¬ 
like  (or  "weak-homomorphic")  relations  may  be  established  between 
generalized  set  and  ordinary  set  operations,  etc. 

One  application  of  the  above  can  be  made  to  the  use  of  a  panel 
of  experts  in  obtaining  distributional  information  concerning  a  col¬ 
lection  of  objects  (such  as  possible  classifications  of  a  ship)  when 
the  values  of  the  "distribution"  do  not  add  up  to  unity  -  often  the 
sum  will  exceed  one).  It  does  not  mean  the  laws  of  probability  are 
being  violated  nor  should  normalization  be  carried  out  to  make  the 
"distribution"  formally  a  legitimate  one.  Indeed,  the  distribution 
represents  probability  values  of  properly  chosen  compound  overlap¬ 
ping  events  which  the  panel  as  human  integrators  of  information  in 
effect  tacitly  employ  -  or  equivalently  the  distribution  represents 
coverage  probabilities  of  a  suitably  implicitly  chosen  random  set  of 
interactions.  (See  Chapter  9  for  more  details.) 

Finally,  note  that  the  field  of  random  set  theory  is  not  as 
well  known  as  other  branches  of  probability  theory.  For  background, 
see  the  general  works  of  Kendall  [137],  Matheron  [171]  and  Ripley 
[217].  See  also  Cressie  [43],  Ripley  [217]  and  Artstein  and  Vitale 
[7]  for  results  concerning  random  set  versions  of  the  Laws  of  Large 
Numbers  and  the  Central  Limit  Theorem. 
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CHAPTER  7 


SOME  THEORIES  OF  UNCERTAINTY 


In  this  chapter,  we  discuss  different  approaches  to  the  problem 
of  uncertainty  modeling.  These  include  Manes'  distributional 
theory,  Watanabe's  system,  Gaines'  uncertainty  logic  and  Schefe's 
agreement  probabilities. 


7.1  Introduction 

Generalized  set  theory  in  its  general  formulation,  as  mentioned 
previously,  already  is  an  extension  of  probability  theory  over  dis¬ 
crete  spaces.  (Thus  there  may  be  no  need  to  construct  systems  which 
formally  generalize  both  fuzzy  set  theory  and  probability  theory) . 
This  is  true  by  simply  recognizing  that: 

(1)  All  probability  functions  p  (over  a  discrete  space)  are 
also  possibility  functions  (but  of  course,  not  conversely). 

(Indeed,  all  cumulative  and  anti-cumulative  distribution  functions 
are  also  possibility  functions.) 

(2)  The  usual  probability  operations  may  be  interpreted 
through  the  particular  fuzzy  set  system 

* 

9'  =  ( 1- ( • ) ,prod , sum)  , 

noting  the  operation  sum  is  the  same  as  bndsum  (or  maxsum)  over  the 
cone  of  all  probability  functions. 

(3)  Complete  analogues  (as  mentioned  previously)  may  be  es¬ 
tablished  between  the  concepts  of  joint  conditional,  marginal  and 
(via  Bayes'  theorem)  posterior  possibility  and  probability  func¬ 
tions.  Hence,  all  possibilistic  forms  will  reduce  to  ordinary 
probabilistic  counterparts  when  9*  Is  employed  with  respect  to 
probability  functions  only.  This  may  be  summarized  by  the  many-to- 
one  correspondences  (over  spaces) : 


p-possibility  function 


rnot 


♦i 


ror 


p-probability  function 
1  -  (•  ) 

prod 

sum 


(4)  It  should  also  be  mentioned  that  an  obvious  interpretation 
of  any  probability  measure  Pr  over  a  o-algebra  a  of  subsets  of 
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its  base  space  X  :  Pr  :  a  -»  [0,1]  is  a  fuzzy  set  membership 

function  representing  a  fuzzy  subset  of  a  . 

(5)  On  the  other  hand,  note  that  probability  theory  is  not  a 
truth  functional  system  as  Zadeh's  fuzzy  set  theory  is.  (See  the 
remarks  in  7.4  for  Gaines'  system.  See  also  section  2.3.9  (A).) 

Figure  A  presents  some  relations  between  various  logical 
systems . 

In  the  wake  of  the  criticisms  of  Zadeh's  fuzzy  sets,  several 
systems  have  been  developed  which  simultaneously  extend  both  Zadeh's 
original  formulation  of  fuzzy  set  theory  and  classical  probabilistic 
concepts.  Among  these  are  Hirota's  "probabilistic  sets"  [111]  - 

essentially,  randomized  fuzzy  sets  (perhaps  more  accurately,  ran¬ 
domized  fuzzy  set  membership  functions),  and  the  large  literature 
devoted  to  fuzzifying  probability  spaces  and  random  variables.  (See 
the  survey  in  [51],  as  well  as  Kwakernaak  [148]  and  Klement  and 
Schwyhla  [142].)  Manes  [169],  based  upon  his  previous  work  with 
Arbib  [5]  in  developing  category  theory  models  for  fuzzy  machines, 
used  a  form  of  the  Kleisli  category  theorem  [165]  in  deriving  a  very 
general  theory  of  distributions  which  extend  not  only  Zadeh's  (min- 
max,  etc.)  fuzzy  set  concepts  and  classical  probability  relations, 
but  also  generalizes  the  basic  structure  of  topological  neighborhood 
theory,  credibility  theorv  and  other  approaches  to  the  modeling  of 
uncertainties.  (However  -  see  Goodman  and  Nguyen  [92]  for  a 
characterization  of  Manes'  system  which  shows  that  although  it 
generalizes  Zadeh's  original  concept  of  fuzzy  set  theory,  it  is  not 
compatible  with  more  general  fuzzy  set  systems  ($not'  ^or^  ' 

See  also  the  following  section.) 
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Figure  A.  Processing  flow  of  Information  and  relationships 
between  some  selected  logical  systems. 
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7.2  Manes 1  distributional  theory 

A.  Summary  of  Manes'  basic  work. 

Manes  [169]  developed  a  theory  of  "fuzzy"  theories  as  an  out¬ 
growth  of  his  earlier  work  [5]. 

1 .  Consider  the  category  SET. 

(a)  Given  a  monad  (7,e,p)  over  SET,  i.e.,  7  :  SET  -*  SET  is  a 

functor  and  e  :  Id  — s’  and  p  :  Toy  y  are  natural  trans¬ 

forms  satisfying  certain  commutative  relations  (see,  e.g.,  S. 
MacLane  [165]),  then  by  defining  #  ,  where  for  any  sets  X,Y  and 
a  €  Ar  (X,!T  (Y)  )  , 

#(«)  =  p(y)oy(a)  e  Ar(* (X) ,? (Y) )  , 

then  Kg.  is  a  category  -  the  Kleisli  category  associated  with  7 
(again,  see  [169])  where 

Ob ( Kg. )  =  (9-(X)  |  X  any  set) 

Ar(Kg)  =  (#(<x)  |  a  «  Ar(X,sr(Y>)  ,  X,Y  any  sets) 


with  composition  «>  defined  by 

#(p)o#(a)  =  #(#poa)  ; 

and  with  identity  maps 

#(e(X))  *  idy(x)  ; 

and  finally,  with  the  relation 
#(a)oe(X)  =  a  ; 


(M3) 


(M2) 


(Ml) 


holding  for  all  a  «  Ar(X,5T(Y))  ,  all  p  e  Ar(Y,:r(Z))  ,  and  all  sets 
X  ,  Y  ,  Z  .  (Ml)  and  (M2)  imply  e(X)  is  uniquely  determined. 

(b)  Conversely,  given  a  mapping  7  :  Ob (SET)  -♦  Ob (SET)  a  mapping 
e  :  0b( SET)  -*  Ar(SET)  ,  where  for  any  set  X  ,  e(X)  e  Ar(X,3"(X))  , 
and  mapping 


u  Ar(X,!T  (Y) ) 
all  sets 
X ,  Y 


u  Ar (3"  (X)  ,7  ( Y )  )  satisfying  (Ml)  , 

all  sets 
X,  Y 


(M2)  ,  (M3)  .  Then  by  defining  p  :  Ob(SET)  -  Ar ( SET )  by,  for  any 

set  X  , 


M(X)  =  #(idg(x))  , 


and  defining  for  any  f  «  Ar(X,Y)  , 
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*<f)  =  #(e(Y)«f)  «  Ar(s"  (X)  ,sr  (Y) )  , 

and  composition  via  (M3)  ,  it  follows  that  3'  is  a  functor, 

e  :  id  — s'  and  p  :  s'os'  — ♦  3"  are  natural  transforms  and  (s'.e./j) 
is  a  monad,  etc. 

Manes  calls  (s'.e,#)  a  theory ,  when  case  (b)  holds  (which  is 
thus  equivalent  to  specifying  monad  [7,e,p)  and  case  (a)).  s'  is 
the  distribution  map,  e  is  the  Imbedding  map  and  #  is  the  domain 
extension  or  lifting  operator  .  This  is  motivated  by  the  special 
cases  presented  next . 


may  be  made,  then  (M3)  implies  that  composition  o  is  the  same  as 
fuzzy  relational  product 
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<$>  (x,z)  =  sup  min 

8(A)  ysy 

for  all  x  e  X  ,  z  s  Z  ,  which 

(ii)  For  any  set  X  : 


(^(x.y),  4>b(Y.z)) 

is  an  associative  operation. 


sr(X)  =  (p  |  p  :  X  -♦  [0,1]  &  p  is  a  finite  (supported)  pro¬ 
bability  function  over  X)  , 

e(X)(x)  ^  <f>.  .  (relative  to  X)  ;  all  x  e  X 
\  / 

(#(a)(p)(y)  =  z  (p(y)  •  a  (x)  (y)  )  ,  for  any  a  :  X  7  (Y)  ,  all 

xeX 


y  e  Y  . 

In  this  case,  for  any  f  :  X  -»  Y  ,  for  any  sets  X  and  Y  , 
7(t)  :  sr  (X)  -*  s’ (Y)  ,  where  for  any  p  «  7  (X)  and  any  y  e  Y  , 


y (f ) (P) (Y)  =  ^  p(x)  , 

x«f-1 (Y) 

i.e.,  7(f)  (p)  is  the  probability  function  of  r.v.  f(V)  where 

r.v.  V  has  prob.  function  p  . 

(iii)  For  any  Set  X  : 

y(X)  =  y(X)  ,  the  class  of  all  ordinary  subsets  of  X  , 

e(X) (x)  =  (x)  ;  all  x  «  X  , 

(#(a) ) (C)  =  U  a(x)  =  a(C)  , 
x«C 

for  any  a  :  X  -*  »{Y)  ,  and  any  C  «  »(X)  . 

Manes  calls  this  case  "possibilistic"  set  theory. 

Manes  also  specializes  theory  (y,e,#)  to  a  number  of  other 
systems  including  "crisp"  set  theory  with  y(X)  =  X  ,  priority 
theory,  credibility  theory  and  topological  neighborhood  theory. 


3 .  Additional  co—ents. 

Thus  Manes  cleverly  shows  that  the  concept  of  a  Kleisli  cate¬ 
gory  associated  with  a  monad  over  SET  is  sufficient  to  describe  not 
only  Zadeh's  fuzzy  set  theory  and  classical  discrete  probability 
theory  but  a  number  of  other  theories  of  uncertainties.  He  obtains 
a  number  of  basic  properties  common  to  all  of  these  theories.  In 
summary,  this  includes: 

(a)  Development  of  maps  between  theories  as  essentially  "natural" 
(i.e.,  commutative-form)  mappings  between  the  three  corresponding 
components  of  theories,  which,  following  category  theory  concepts, 
is  specialized  to  quotient  or  surjective  types,  subobject  or  injec- 


Sow  Theories  of  Uncertainty 


431 


tive  types,  and,  further,  to  isomorphisms  (which  are  characterized 
as  both  surjective  and  injective  types).  Also,  cartesian  products 
of  theories  are  defined  in  the  obvious  way.  Examples  include:  for 
any  fixed  A  «  [0,1]  ,  consider  the  level  set  mapping  TA  ,  where 

for  any  set  X  and  any  A  «  9(X)  ,  TA  (A)  =  <|>~^  [ A ,  1  ]  yields 

* 

( TA , e  »#A)  as  a  quotient  theory  of  Zadeh's  fuzzy  set  theory,  where 

e*(X)(x)  =  (x)  ,  all  x  «  X  ,  and  for  any  a  :  X  -»  T(Y)  , 

TA(a)  :  X  -♦  9(X)  ,  where  for  all  x  e  X  ,  T A (a )  (x)  5  TA(a(x))  ,  and 

for  any  A  «  9{X)  ,  and  thus  ^^[A.l]  «  9(X)  , 

^A  (  ^  ’  1  ]  )  =  (^(«)  )  (<})A)  )-1CA  .13  . 

On  the  other  hand,  (9,e,#)  where 

3"(X)  =  (A  |  A  «  9{X)  &  sup  $  (X)  =1)  ,  for  any  set  X  is  a  sub- 

X€X 

theory  of  Zadeh's  fuzzy  set  theory.  ( 9"  ( X )  is  the  class  of  norm- 
able  membership  functions.) 

(b)  Development  of  equality  and  membership  relations: 

Define  for  any  theory  9  =  (9,e,#)  ,  9({0,1})  as  the  theory 

truth  values.  Define  the  general  membership  function  <{>  ,  for  any 
set  X  ,  <p(X)  :  9(X)  x  «r(X)  9({0,1})  ,  by,  for  any 

C  e  9{X)  ,  A  «  9(X)  . 

<|>(X)  (C,A)  =  <t»(X)  A(C)  3  9-  <«J>C)  ( A)  , 

<|>c  :  X  -♦  {0,1}  ordinary  membership  function  of  C  . 

Define  the  equality  map  eq  ,  where  for  any  set  X  , 
eq(X)  :  9"  ( X )  x  9"  ( X )  -♦  9  {{0,1})  where  first  for  any  x  e  x  and 
A  €  sr  (X)  , 

eq(X)(e(X)(x),A)  =  9 (<f> {x) ( A) }  , 

and  for  any  A,B  «  9(X) 

eq(X)  (B,A)  =  {#(9(<t.{.  }(A)))){B)  . 

Alternatively,  Manes  could  have  developed  a  one-branch  truth 
theory  based  on  9  (  { 1 } )  instead  of  9"  ({0,1})  ;  evaluations  are 

similar  for  only  this  one  component.  (The  idea  of  using  9(0,1)  is 
based  on  Gaines'  earlier  approach.)  Or,  one  could  choose  any  number 
of  values  0  *  aQ  <  • •  <  an  *  1  and  use  9 ( {a^, . . . ,a  } ) . 

For  Zadeh's  fuzzy  set  theory 

9-  (4>c)  (A)  *  <t>c(A)  ;  all  A  e  9  { X )  ,  C  e  9(X) 

where  $C(A)  e  9(0,1}  , 
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**C(A) 


(0) 


SUP  ( <t>  ( X)  )  =  SUp  $A(X) 

*«<fc  (0)  x«C 


(A){1)  =  sup-1  <‘MX>)  =  SUP  ^A(X) 
*C(A)  X€  <|>c  1  (  1 )  A  X€  C 


and  for  any  A,B  «  5{X)  ,  eq(X)(B,A)  e  5(0,1}  , 


*eq(X) (B,A) (0) 
^eqfX) (B,A) (1) 


sup  min  ($A(x),  tg(Y)) 

x ,  ye  X 

x*y 

sup  min  ($A(x)  ,  <t>B(x)  ) 
X«X 


For  discrete  probability  theory,  *  (<PC)  (p)  is  the  probabil¬ 
ity  function  of  r.v.  <|>C(V)  where  V  is  a  r.v.  corresponding  to 
probability  function  p  e  5(X)  ,  i.e.. 


<PC(V)  =  1 

iff  V  €  C 

4>C(V)  -  o 

iff  v  e  c 

and  hence 


9-(<M  (P)  (0)  =  Pr (V  e  C)  =  Z  p(x)  «  1  -  z  p(x) 
"  X«C  x«C 


and 


*(9r)(P)(l)  =  Pr(V  €  C)  =  z  p(x)  . 
C  xeC 


For  any  p,q  «  5 (X)  , 


eq ( X) ( p , q ) ( 0 )  =  z  p(x)q(y)  =  1  -  z  p(x)q(x) 
x ,  ye  X  xe  X 

x*y 

eq(x) (p,q) (1)  =  z  p(x)q(x) 
x«X 


(c)  Manes  defines 


I.  eq(X)(p,p)  as  degree  of  vagueness  of  p  e  *r(X)  , 


II.  A  theory  7  =  (5,e,#)  is  ant  t  -ref  1  ex  ive  iff  for  all 

sets  X  and  A  «  5(X)  ,  if  eq(X)(A,A)  s  1  ,  then 
A  =  e(X)({x})  for  some  x  «  X  ,  i.e.,  A  is  "crisp". 
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III.  A  theory  £  has  symmetric  equality  iff  for  all  sets 
X  ,  and  all  A,B  e  7 (X)  , 

eq (X) ( A , B )  =  eq(X)(B,A)  , 

IV.  A  theory  £  satisfies  the  eigenstate  conn,  cion  iff 

for  all  sets  X  and  all  x  «  X  and  A  e  ir(X)  ,  if 
$(X)({x},A)  s  l  ,  then  A  is  crisp  where  A  =  e(X) (x)  . 

V.  Fcr  any  set  X  , 

Prcp(X)  =  (9-{0,l})X  , 

is  defined  as  the  class  of  all  propositions  over  X  .  A 
theory  £  is  faithful  iff  for  all  sets  X  ,  the  mapping 

P(X)  :  7  (X)  -  Prop(X)  , 
where  for  any  A  «  y{X)  ,  and  x  «  X 

(P  (X)  (A)  )  (x)  2  <t>(X)({x),A)  , 

is  injective. 

VI.  A  theory  £  is  propos i 1 1 ona I  I y  complete  (or  separating) 

iff  for  all  sets  X  and  any  A,B  e  sr(X)  ,  A  *  B  ,  there 
is  a  «  Prop { X)  such  that 

(#{<*))(A)  *  (#{a))(B)  . 

VII.  A  theory  £  is  consistent  iff  7  is  injective  relative 

to  Ar  ( SET )  or  equivalently,  iff  for  each  set  X  , 
e (X)  :  X  -  y(X)  is  injective. 

VIII.  A  theory  £  is  noise-free  iff  7.01  =  ^ 

IX.  A  theory  £  has  crisp  points  iff  fc;  sr.y  set  X  , 
x  «  X  ,  7 ( {x} )  =  {x}  . 

A  number  of  the  above  definitions  have  direct  category  theory 
counterparts  (for  example,  "faithful"  in  V). 

It  follows  that:  any  faithful  theory  is  propositionally  com¬ 

plete  and  if  f  e  Ar(X,Y)  is  injective  (bijective)  so  is  7(f) 
(bljective).  Any  consistent  theory  with  crisp  points  is  noise-free 
and  a  theor,  has  crisp  points  iff  for  all  sets  X  ,  and  A  €  7 (X)  , 

$(X)(X,A)  3  1  •  Any  theory  has  a  largest  subtheory  with  crisp 

points.  Zadeh's  fuzzy  set  theory,  finite  probability  theory  and 
possibilistic  set  theory  satisfy  conditions  II  -  VIII.  Probabilistic 
set  theory  has  crisp  points  but  Zadeh's  fuzzy  set  theory  doesn’t, 
nor  does  possibilistic  set  theory. 

(d)  XQ-ary  operations  and  homomorphisms  in  Manes'  sense: 


434 


Goodman  and  Nguyen 


and 


any 


Fix  theory  £  =  (ff,e,#)  . 

Fix  set  XQ  .  Then  there  is  a  bijection 

Q(XQ)  ,  where  for  any  A  e  *(XQ)  ,  ;px  (A) 

o 

X. 

set  Y  ,  pv  (A)  ( Y)  :  s'  ( Y )  -♦  7(Y)  where  for 

A 


pv  between  7(X  ) 

A  O 

o 

e  Q(X  )  .  where  for 
any  a  :  Xq  -•  7  (Y)  , 


(A) ( Y) ) (a )  3 


(#(a))(A)  . 


Q(Xq)  is  called  the  class  of  all  (abstract)  XQ-ary 
gr  -opera  t I  ons  . 


For  any  sets  X  ,  Y  and  f  :  7(X)  -♦  ^(Y)  ,  f 

w.r.t.  X  and  Y  and  9"  ,  iff  for  any  set  X  , 

o 

and  hence  any  p  (A)  e  Q(X  )  , 

A  O 


is  a  homomorphism 
any  A  e  7  (Xq)  , 


fop>;  ( A )  ( X) 
o 


px  (A)(Y)(fo(  )) 


iff  f  =  #  (a )  ,  for  some  a  :  X  -»  ir(Y)  in  which  case  a  may  be 

arbitrary  in  general  and 

a  =  foe(X)  . 


For  any  sets  Y  and  f  :  7  )*  *  •  *x!r  (xn)  -»  T  (Y)  , 

f  is  an  n-homomorphl  sb  ire  all  marginals  f^  :  !r(Xj)  -*  9"(Y)  , 
j  =  l,...,n  of  f  are  (1-)  homomorphisms . 

(e)  Joint  distributions  and  independence. 

Suppose  £  =  (gr , e , # )  is  a  theory. 

Let  a  :  X1  x  X2  -♦  7  (Y)  be  arbitrary  for  any  sets  Xj ,  X2 ,  Y  . 
Then  define  a(1).  a(2)  :  ’’(Xj)  *  "*  bY 


a(l) 

a(2) 


(#' ( ( # (a ( Xj , • ) ) )  x  ) (A2) ) ) 


(#(  (  (#(a  ( •  ,x2)  )  ) 


A2«7 (X2) 
A1€3T(X1) 


a  ^  and  a2  are  the  natural  component-wise  ways  to  extend  a  via 

#  .  Similar  constructions  hold  for  any  a  :  X,x---xX  -♦  ir  ( Y )  .  If 

1  1  n 

a  =  ••  =  a  ,  denote  the  common  value  by  a  ;  a  is  the  basic 

( 1 )  (n) 

commutative  domain  lifting  or  extension  operator  for  a  . 

It  can  be  verified  by  use  of  (M3)  that  for  any  a  as  above, 

° ( 1 )° ( 6lXl )*e(X2 ) )  =  a ( 2 )° (e(X1 )xe(X2 ) )  =a  . 

In  particular,  consider  a  =  e(XjXX2)  for  any  sets  X1  ,  X2  . 
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Then,  if  e(i)<x1xX2^  =  e(2)*XlxX2^  ' 

i{Xj  x  X2)  :  ?(XX)  x  9-(X2)  -  tr(X1  x  X2 ) 

is  called  the  independence  map  and 

Ind{X1  x  X2)  3  {e (X1  >  X2)(A  x  B)  |  A  €  9  ,  B  «  :r(X2)) 

is  called  the  class  of  all  independent  distributions  of  X^  x  X2  . 
Manes  showed  for  any  theory  £  : 

(i)  For  all  sets  Xj  ,  X  ,  e(X1  x  X2 )  exists  and  is  thus  a 
2-homomorphism  iff 

(ii)  For  all  integers  nil,  and  sets  X1(...,X  ,  Y  and  any 

a  :  X,  x  X„  x...X  -♦  *(Y)  ,  a  :  s' ( X,  }x •  •  x*r  ( X  )  -*  ^(Y)  exists  and  is 

1  2  n  in 

an  n-homomorphism,  iff 

( iii )  For  any  sets  XQ  ,  Yq  and  all  A  e  3"(Xo)  ,  B  e  S'  ( Yq )  and 

any  set  Z  , 

Px  (A)o pY  (B)(2)o  =  pv  (B)(Z)opx  (A)o 

for  suitable  composition  arguments,  in  which  case  £  is  called  a 

commutative  theory. 

If  £  is  a  commutative  theory: 

(I)  For  any  a  :  XjX-  •  "xXn  -*  Y  ; 

aoe(X,)x->xe(X  )  =  a 
1  n 

(II)  For  n  =  2  , 

#(a)oe(X1  x  X2)  =  a 

(III)  For  n  =  1  , 

#  (a  )  =  a 

which  justifies  a  being  also  called  the  n-tiomomorph  I  c  extension  of 
a  ,  although  #(a)  is  also  an  extension  of  a  (but  not  n-homo- 
morphic  in  general). 

(IV)  The  equality  mapping  for  £  is  symmetric. 
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(V)  7  has  crisp  points  iff  for  all  sets  X  and  Y  , 

e(X  x  Y)  :  9'(X)x‘r(Y)  -♦  7  {X  x  Y)  is  injective. 

(VI)  Let  X, , . . . ,X  , Y  be  any  sets  and  let  f  :  X,x--*xX  -►  Y  , 
in  In 

nil,  all  be  arbitrary.  Then,  define  the  n-homomorphic  extension 

or  basic  commutative  multiple  domain  and  range  lifting  or  extension: 

f  :  y(X1)x..-xy(X  )  -  y{Y)  by 

't  =  (e(Y)of)"  , 

analogous  to 

y(f)  :  y(X1x-.-xXn)  -»  y(Y)  ,  defined  by 
7(t)  =  #(e(Y)of)  . 

Thus  the  operator  ^  extends  Zadeh's  fuzzy  set  extension 
principle . 

(VII)  It  follows  that  Zadeh's  fuzzy  set  theory,  finite  probability 
theory,  and  possibilistic  set  theory  as  well  as  other  theories  are 
all  commutative  theories. 


In  the  following,  let  nil  be  any  integer  and  X1,...,Xn,Y 
arbitrary  sets; 

(i)  Zadeh's  fuzzy  set  theory; 

For  all  A  €  7  (X)  ,  B  e  7  (Y) 


e(X  x  Y) (A,B)  =  A  x  B  , 

i.e.,  for  all  x  «  X  ,  y  «  Y 

4>_  (x,y)  =  min  (<t>.(x),  $R(y))  . 

e(xxY) (A,B)  *  a 

For  any  a  :  XjX  -  ■  -  xXn  -♦  ?(Y)  , 
and  any  Aj  e  ^ ( X ^ )  ,  j  =  l,...,n,  and  any  y  e  y  , 


<?_  (Y) 

ct(A. - - A  ) 

l  n 


sup  min  ($A  (Xj) . <^A  (xn),<t>  (y)) 

1  2  a<xi . V 


VV 


l  j  =  l , . .  .  , n 


and  hence  for  any  f  : 
j  =  1 ,  .  .  .  ,  n  ,  and  ar.y 


XjX  -  •  •  xXn  ->  Y  ,  any 
y  e  y  ,  letting  a 


A,  e  7(X.) 
=  e(Y)of  , 
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$  (Y)  = 

t(A1# . . . ,An) 


sup  min  ($  ( X- ) , • 

xj€  Xj.  A1 

j=l . . . . ,n, 

I  f (Xj, . • . ,xn)=y 


•'♦a  (xn)} 


(ii)  Finite  probability  theory: 

For  any  (probability  functions)  p  e  sr  (X)  and  q  e  ?(Y)  , 
and  all  x  e  X  ,  y  e  Y  , 

e(X  x  Y)(p,q)(x,y)  =  p(x)*q(y)  , 

i.e.,  e(X  x  Y)(p,q)  corresponds  to  the  ordinary  statistical¬ 
ly  independent  joint  probability  funcions. 

For  any  a  :  X1  x‘‘‘xXn  *♦  T(Y)  <  and  any  p^  e  «r(X  . )  , 

j  =  1 ..... n  ,  and  any  y  €  y  , 

“(Pl . Pn)(Y)  =  3  P1(x1)--*Pn(xn)-a(x1 . xn)(y) 

V  Xj' 

j=l , . . . ,n 

=  E(a(V1 . Vn)(y)) 


with  Vj  ,  r.v.,  corresponding  to  probability  function  p^  , 

j  =  1 . n  ;  all  Vj's  being  statistically  independent. 

Hence  for  any  f  :  XjX-^-xXj^  -*  Y  ,  and  p^  e  :r(Xj ) 
j  =  1 ..... n  and  any  y  «  Y  , 

t(pl . pn)(y)  =  I  Pi(Xi)---pn(xn) 

“  j*  A.  § 

J-l?..V,n, 

f  (Xj . xn)=y 


(iii)  Possibilistic  set  theory: 

For  any  C  e  9(X)  ,  D  «  3>{Y)  , 

e(X  x  Y) (C.D)  =  C  x  D  . 

For  any  a  :  x1x,,’xEn  -»  9(Y)  ,  and  any  C ^  e  9(Xj) 

j  =  1 , . . . ,n  , 


a(C  . C  )  = 

l  n 


W 

j=l , . . . ,n 


“(*1, • • . 'xn] 


Thus,  for  any  f  :  x1x,,'xXn  -*  Y  .  and  any  C ^  €  »(Xj) 
j  =  1 , . . . ,n  , 
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t(C1#...,Cn)  =  U  {f (Xj, . . . ,xn) }  =  f(C1x--.xCn)  . 

Xj€Cj' 
j=l , .  .  .  ,n 

(VIII)  In  particular,  consider  for  any  integer  nil, 

Booln  2  {f  |  f  :  (0 , 1 }n  -  {0,1}}  , 

the  set  of  all  classical  n-ary  truth  functions  or  Boolean 
polynomials  and  define 

Booln  3  {f  |  f  €  Booln)  . 


Hence,  any  t  e  Bool  is  such  that 

n 

f  '•  (9'({0,l}))n-»:r({0,l})  is  the  n-homomorphic  extension  of  truth 

function  f  :  { 0 , 1 } n  -*  {0,1}  . 

In  particular,  consider  the  classical  Boolean  polynomials: 


n  = 

2 

f 

=  &  : 

&(0,0)  = 

n  = 

2 

f 

*  or 

or (0,0) 

n  = 

1 

f 

=  not 

:  not(0) 

n  = 

1 

f 

■o 

n 

id ( 0 )  = 

Then  for 

(a)  Zadeh's  fuzzy  set  theory,  it  follows  for  any  nil  and  any 
f  «  B°cln  >  anY  Aj  €  *({0,1})  ,  j  =  1 , . . . ,n  , 


4>  (Y)  = 

r(Ax . An) 


sup  min  ( <t>.  (x  ) . (x  n 


.  .  .  ,xn«  {0, 1}  with 
f (xx , . . . ,xn)  =  y 


for  ye  {0,1}  . 

Thus,  for  n  =  1  and  any  A  e  *{0,1}  , 


i  .e.  , 


id(A) 
td(A)  3  A  , 


(0)  =  <*>.(0)  .  *  (1)  =  <t>- ( 1 ) 


id  ( A ) 


but  note 


$  (0)  =  <|>A ( 1 )  .  $  (1)  =  <*».  (0) 
not (A)  not (A)  A 

For  n  =  2  ,  and  any  Ax  ,  A2  «  *{0,1) 
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4>  (0)  =  max(min(4>»  {0),  4>a  (0)),  min(4>.  (0),  4>A  (1))  , 

S(A1(A2)  A1  a2  A1  2 

min(  4>,  (1),  4>A  (0)))  , 

A1  "2 


4>  (1) 

fi(A1,A2) 

4>  (0) 
or(A1 ,A2) 


=  min  (4>.  (1)  ,  4>.  (1)  ) 
A1  A2 

=  min  (<K  (0)  .  <K  (0) ) 
1  2 


4>  (1)  =  max  (min(4>.  (1),  4>  (1)),  min(<f>.  (1),  4>.  (0))  , 

or(A1,A2)  A1  a2  a1  a2 

min  (4>.  (0),  4>  (1)))  . 

A1  A2 

(b)  For  finite  probability  theory: 

For  n  =  1  and  any  p  e  fr  ( { o ,  1 } )  , 

ld(p)  =  p 

and 

not(p) (0)  =  p(l)  ;  not { p ) ( 1 )  =  p(0)  -  1  -  p(l)  . 

For  n  =  2  ,  and  any  p ,  q  «  tr  { 0 , 1 }  , 


fi(p.q)(l)  =  P'l)-q(l) 

6(p.q)(0)  -  1  -  p(l)-q(l) 

or (p,q) ( 1 )  =  1  -  p( 0 ) • q ( 0 )  =  1  -  (1  -  p(l))(l  -  q(l)) 

=  P(l)  +  q(l)  -  P(l)q(l) 
or (p,q) (0)  =  p(0) • q(0) 

-  (l-p(l))(l-q(l))  =  1-P(1)  -  q( 1 ) )  +  P ( 1 ) • q ( 1 )  . 

(c)  For  possibilistic  set  theory: 

Note  that  *{0,1}  =  (0  ,  {0},  {!},  {0,1})  . 

For  n  =  1  ,  and  any  C  e  *{0,1} 
ld(C)  =  C  , 

not (0)  =  0  ,  not ({0,1})  =  {0,1} 
not( {0} )=  {1},  not { { 1 } )  =  {0}  , 

For  n  =  2  : 

§  is  symmetric  and 

6(0,0)  =  6(0, { 1 } )  =  0  , 

$(«J,{0>)  =  6(0, (0,1))  =  §  ( { 0 }  ,  { 0 }  )  , 
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-  £{ {0 } , { 1 }  }  =  £({0}.{0.1})  -  {0} A 

£({1}.{1>)  =  {1} 

fi{{i},{0.i})  -  S( {o, 1} , {o, i) )  *  {0,1}  . 

or  is  symmetric  and 

§r(0,0)  =  or (0, {0} )  =  or(0, {0,1})  =  0 
Sr(0, {1} )  =  or { {0} , { 1 } )  »  or ( [ 1} , { 1} ) 

-  or ({1}, {0,1})  =  {1}  , 
or ({0}, {0,1})  =  or {{0,1}, {0,1})  =  {0,1}  . 

Manes  interprets: 

0  =  "undefined"  {1}  =  "yes" 

{0}  =  "no"  {0,1}  =  "maybe". 


B.  Major  issuea  concerning  Manes'  theory. 

Manes'  extremely  attractive  theory  of  "fuzzy"  theories  essen¬ 
tially  postulates  that  all  of  these  theories  (Zadeh's  system,  finite 
probability  theory,  etc.)  have  the  form  of  a  monad  and  the  associa¬ 
ted  Kleisli  category  relative  to  SET.  However,  these  results  -  as 
typical  of  a  category  theory  setting  -  depend  to  a  degree  upon 
commutativity  of  operations.  This  can  result  in  rather  restrictive 
applications  relative  to  general  fuzzy  set  theory  as  developed  here. 
More  specifically,  the  following  result  holds: 


Theorem  ( Goodman-Nguyen )  [92]. 

Let  *  =  ($not;'  4I&»  <{>or)  k®  an  arbitrarY  generalized  (fuzzy) 
set  system. 

Define  *  by,  for  all  sets  X  , 
r(X)  ^  *(X)  . 

Define  e  by,  for  all  sets  X 
e(X)  :  X  -♦  »(X) 
where  for  all  x  e  X  , 

e(X)(x)  =  {x)  . 

Finally,  define  #  by,  for  any  sets  X  ,  Y  ,  and 
R  «  5{X  x  Y)  , 

#(R)  :  ?(X)  S(Y)  , 

where  for  any  A  «  tF  { X )  and  any  y  «  Y  , 
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^(#(R)(A)(y)  ^RoA(Y) 

=  *or(WX)'  *R(X'Y)))  • 

X«X 

This  definition  is  also  motivated  by  [O,l)-logic  considerations  as 
the  evaluation 

ll  (y  «  R« A ) li 

=  it  ( ( 3 X)  ( (x  €  A)  &  {  (x.y)  €  R)  )  ) ii  , 

through  9  .  (See  Chapter  2.) 

Then  for  these  definitions  for  sr ,  e  and  #  : 

(a)  Axioms  (Ml)  and  (M2)  hold. 

(b)  Axiom  (M3)  holds 

Iff  <}>&  is  right  distributive  over  4»or  ,  i.e. 

^&(a'  *or<b,c))  =  ^or ( { a ' b }  '  $&{a,c))  : 
all  a , b, c  «  [0,1]  (2) 

Iff  composition  °  is  associative 

So(Rop)  =  (SoR)op  ;  all  R,  S,  P, 


where 


P  €  s(W  x  X),  R  «  ?(X  x  Y),  S  e  *(Y  x  Z)  ; 


all  sets  W,  X,  Y,  Z  ,  where  composition  is  typically  given  in  eg. 
(1)  ,  in  which  case  J  =  (:r,e,#)  is  a  theory  in  Manes’  sense. 


Proof  (2) :  For  any  sets  X,  Y,  Z  ,  for  any  R  «  ?(X  x  Y) , 
S  «  s(Y  x  Z),  A  €  ?(X)  ,x€X,yeY,z€Z: 


.  ■  *or<  fS<V.  =  ) ) )  . 

left  hand  side  of  M3) 


(3) 


(right  hand  side  of  M3) 


RHS ( M3 )  =  ($-oR(x,z)  .  4>.(x)))  , 


ror '  T& '  TS°R 

X6X 


(4) 


where  is  9*ven  e<3(l)  and 

*S°R{x'z)  =  ^or(^(^R(x'y)'  *s(y'z)))  • 
y«Y 

If  <t.  is  right  distributive  over  <6  _  ,  then 
&  Tor 


(5) 
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<t>&(tRoA(Y)  '  $S(Y»2))  =  4,or(‘t'M(x,Y,z)  )  (6) 

X€X 

where 

<t>M(x.y,z))  3  '  <I>r(x*Y)»  <l>s(y»z))  ,  (7) 

and 

^&(^SoR(x'z)  '  ^a(x))  =  <*lor(^M(x'Y'z)  )  •  (8) 

yeY 

Hence,  substituting  (7)  into  (3)  and  (8)  into  (4)  yields 

LHS(M3)  =  $or(*M<X»Y.z) )  =  RHS (M3 )  . 

xeX, 

y«y 

Conversely,  if  Axiom  (M3)  holds,  choose  A  -  X  =  (x^Xg)  , 

y  =  (y>  »  ^r(xi *y)  =  b  •  ^r^x2'y^  =  c  '  ^s^Y'z^  =  a  '  otherwlse 

arbitrary.  Hence  eq.(2)  holds. 


As  a  consequence  of  the  above  theorem,  since  as  a  check, 

Zadeh's  system  =  ( l-( • ) , min, max)  and  9'  =  ( l-( • ) , prod, max) 

are  always  right  distributive,  both  satisfy  (Ml),  (M2)  and  (M3) 

and  yield  Manes'  theories.  On  the  other  hand, 

=  ( l-( • ) ,prod,probsum)  ,  9"  =  ( 1- ( • ) , prod , minsum)  and  many  other 

systems  such  as  non-Archimedean  semi-distributive  ones  are  in  gen¬ 
eral  not  right  distributive  and  hence  are  not  compatible  with  Manes' 
theories.  The  latter  class  of  systems  plays  a  key  role  in  the 
weak-homomorphic  approach  to  fuzzy  set  systems  (see  Chapter  6). 

Although  the  above  theorem  shows  composition  with  respect  to 
binary  fuzzy  relations  for  general  fuzzy  set  systems  is  associative 
iff  right  distributivity  holds,  it  is  always  true  that  functional 
transforms  for  general  fuzzy  set  systems  are  always  associative 
since  the  <f>&  operator  is  not  used  here: 

For  any  f  :X-Y,g:Y-Z,h:Z-*W,X,Y,Z,W  any 
sets,  associativity  holds 

h<>  ( go  f )  =  ( ho  g )  o  f  , 

where,  e.g.,  for  any  A  «  9 {X)  ,  y  e  Y  , 


*f (A) (Y)  ~  *or 

x€f-1(y) 


(4>a(x)  ) 


and  any  z  «  Z  , 

*(S°f ) (A) (Z) 


*or  ,  Vx) 

x«  ( go  f )  1  (  z ) 


■  $  (<t>  i>_  (x)  ) 

x«  f  ~ 1 ( y )  yeg_1(z) 

=  <f*g(f(A))(z)  • 
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etc  . 

A  basic  question  then  is  to  determine  which  of  Manes'  develop¬ 
ments  are  compatible  with  general  fuzzy  set  systems:  in  particular, 
when  can  axiom  (M3)  be  eliminated? 

Eytan  [57]  (despite  his  mistaken  idea  that  Fuz(H)  is  a  topos 
-  see  [29]  or  [204])  compares  Manes'  approach  with  his  own  which 
utilizes  Coste's  construction.  (See  also  section  2.4.)  Both  ap¬ 
proaches  rely  on  commutativity  of  basic  system  operators,  which  as 
stated  before,  may  be  somewhat  restrictive.  Further  effort  must  be 
expanded  in  comparing  Manes'  approach,  Fuz(H)  and  Higg(H)  (see 
section  2.4)  and  in  determining  what  extensions  to  general  fuzzy  set 
systems  are  possible. 

Next,  suppose  that  Manes'  theory  is  replaced  by  the  following 
generalized  logical  system  type  theory. 

9"  :  SET  Ar(SET)  attribute  or  dispersion  operator, 

e  :  SET  -♦  Ar(SET)  imbedding  operator, 

tr  :  Ar ( SET)  -*  Ar(SET)  lifting  operator  (modified) 

are  such  that  for  any  X,  Y,  Z  «  SET  ,  T(X),  :r(Y),  9"(Z)  *  0  : 

(a)  e ( X)  :  X  -  7(X) 
where  for  any  x  «  X  , 

«x  =  ( e(X)  )  (x)  €  9-(X)  S  (0,1}X  . 

(b)  n(YX)  :  YX  -  sr(Y)3’(X) 

V 

where  for  any  fey  ,  slightly  abusing  notation, 

n(f)  =  (tt(YX))(f)  . 

(c)  For  any  <}>A  «  tr  ( X )  and  x  e  X 

(tt(e(X)  )  ( <t>A ) )  («x) )  =  4>A(x)  . 

(d)  For  any  f  «  YX  ,  x  e  X 


(e)  Recall  that  id  :  SET  -•  Ar(SET)  where  for  any  X  €  SET  , 
id  (X)  :  x  -*  X 
where,  for  any  x  e  X  , 

id(X) (x)  =  x  . 


Then 
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ft(  id)  (X)  =  id{9"  (X) )  . 

(f)  For  any  x  e  X  and  any  «  3"  ( X )  , 

(TT(ax)(V)(1)  =  ^A(x)  • 

X  Y 

(g)  For  any  f  e  Y  ,  g  «  Z  noting  the  composition 

Tt(g°f)  =  tt(g)  0  tt(f)  . 
implying  it  is  a  functor. 


Examples . 

Ex.  1.  7=7  (i.e.,  no  restriction). 

Pick  *  =  (4>„ot.  4>&,  4>or)  • 
Define,  for  any  x,y  e  X 


(a*  ) 


a  (y)  *  a  (Kronecker  delta) 

x  x  #  y 

and  for  any  f  e  YX  ,  4>A«5{X)  ,  ye  Y  , 


(b '  ) 

i.e., 


( tt ( f )  (4>a)  )  (y)  3  4>or  (tA(x)) 

xef_1(y) 


rr(f)  =  f  (relative  to  $Qr)  . 


Then  for  any  4>A  e  3"(X)  ,  x  e  x 


(c1)  (tt(e(X)  )  (<{>A)  )  (oA) 
=  <P 


or  ,  <Va)) 

(0€e(X)-1(«x) ) 


since  e(X)(x)  =  5^  ,  uniquely. 


( d 1 )  For  any  f  «  YX,  x  e  X,  y  e  y 

(tr(f )  (ax) )  (y) 

=  <t>  (a  (s) ) 

Yor  '  x'  ' 1 

««f  (y) 


gof  «  ZX 
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♦or ( ‘ 

.  1,  • • 0) 

=  1  ,  if  x  e  f_1 (y) 

♦or ( ' ‘ ‘ 

.  0,-  •  ,0) 

=  0.  if  x  e  f_1 (y) 

ftx)(y>  • 

(e')  For  any  $A  e  s(X)  ,  x  e  X  , 

(tt(id)  (X) )  ($A) )  (x) 

=  ♦or  <$A(s)) 

(s«id(X)~1(x) ) 

=  4»a(x)  . 

since  x  =  id(X) -1 (x)  ,  uniquely. 

(f ' )  For  any  $A  «  s (X)  ,  x  e  X  , 

(tt(«x)(<f>A))(l)  =  *or  («t>A(s) ) 

(sea_1(l) ) 

=  VX)  ' 

since  ax(x)  =  1  uniquely. 

Note  also  here  that 

(tT{ax><V><°>  =  <i>or_  ( 1>A  ( s  > ) 

seax1(0) 

=  ♦or 

(seXH {x} ) 

(9’)  For  any  f  €  YX  ,  g  e  ZY  .  ^  e  <r (x)  ,  z  e  2  , 

( n ( go f ) )  ( <t>A)  (z) 

^or  =  ♦or  (<}>A(s)) 

s« ( go  f )  (z)  sef-1 (g-1(z) ) 

=  V  (V_  ♦A(s))  =  (tt(g)  (tT(f)  (A)  )  (t) 

teg  (z)  sef  1 ( t ) 

provided  ♦qj.  is  associative. 

Ex.  2.  O'  (X>  =  0  if  X  a  R  ,  etc. 

For  7(X)  *  0  ,  let 
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y(R)={F|F  isac.d.f.  over  R)  . 

Let  *not  =!-(•) 

=  r-copula,  etc.,  by  Slclar's  theorem  (2.3.6). 

Define,  for  any  x,y  «  X 

6X(Y)  =  HX(Y)  (heavyside  function) 

_  f  0  iff  y  <  x 
1  1  iff  y  i  x 

and  for  any  f  e  YX  ,  F  e  :r{X)  ,  y  «  Y  , 

(tt(  f )  (P) )  (y)  *  pFof_1(Ry_) 

where  is  probability  measure  over  B  determined  by 

c.d.f.  F  and  R  ^  [a,+«)  (right  ray),  etc. 

cl 

Then 

(c'*)  For  any  F  e  «r(X)  ,  x  e  X  , 
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( f  ’  ' )  For  any  F  e  ?(X)  ,  x  e  X  , 

(tt(Hx)  (F)  )  (1)  =  (R1_(in{0,l) ) ) 

=  yh^o) 

=  ^F(-«,x) 

=  F ( x)  . 

Note  that 
(n(Hx) (F) ) (0) 

=  A/FoH~1(R0_(in{0<l}  )  } 

=  tJFW~l{0)  ) 

=  ) 

=  0  . 

{ g '  ’  )  For  any  f  «  YX  ,  g  €  ZY  ,  F  e  *r  (X)  ,  z  e  Z  , 

rr(g°f )  (F)  (z)  =  /JF«(gof)-1(Rz)  =  (MF(f"1(g"1(R2) ) ) 

«  (fj  (g-1 (R  ) )  «  rr(g)  itr(f)  (FJ  )  (z)  =  n(g)°tr{f )  (F)  (z) 
Tt(f) 

Ex.  3.  Manes'  system  revisited. 

Suppose  once  again  (tr,e,#)  is  a  given  Manes  t  litrui'y  . 

Thei.  define  tt  by,  for  any  X,Y  «  SET  anu  f  e  yX  , 

<PA  «  tr  (X)  , 

tr(  f )  =  a-(f)  =  (ftl^lKfj  =  #(e(Y)of)  . 

Cons 3 der : 

(d'")  tKf)(«x'  =  (#(e(Y)«f))(flx)  =  (e(Y)of )  (x)  , 
by  (Ml) 

=  af  (x)  . 

( e ’ ’ • )  n( id) (X)  =  #(e(X)°id(X) )  =#(e(X)) 

-  id (7 (X ) )  , 

by  (M2) 

( g  ’  1  ’  )  it  ( g°  f )  =  #  ( e  ( Z )  °  go  1  ) 

Now 

+t(g)°rt(f)  =  #(e(Z)«g)o#(t(Y)of )  =  #,#(e(Z)°g)<>e(Y)of)  , 
by  ( M3 ) 
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=  #((e(Z)og)of)=fSof)  , 

using  (M3)  and  (Ml)  . 

In  general,  c)  and  f)  from  part  (2)  will  not  be  satisfied 
for  Manes'  theory. 

However,  if  we  define  #  by  , 

For  any  $R  :  X  -  5T(Y)  ,  #($R)  :  ar(X)  -  ft  Y) 
where,  for  any  4>A  e  f  (X)  and  y  e  Y  , 

(  (#(1>R) )  (4>a)  )  (Y)  =  *or<V,>A(s)  '  (Vs))(y)) 

seX 

and  hence  for  any  f  :  X  -♦  Y  ,  41A€3'(X)  ,  ye  Y 

(ft(f)mA))(y)  =  ( #  ( e  ( Y )  o  f )  )  { $A )  (y! 

=  *0r(*&(*A(s)  ’  e(Y)  {f(s)  )  >  (Y>  >  ) 
s«X 

"  *or  <VS»  • 

sef  1  ( y ) 

(Note:  e(Y) (f (s) ) (y)  =  af(s)(Y>-> 


(  f  1  ’  ’  )  For  any  <f)  e  *  (X)  ,  x  «  X  , 

(tt(ax)  (<PA) )  (l) 

=  (  (#(e  ( (0,1)!°8X)  )  (<|>A)  1  (1) 

=  4lor(<,,a(*A(S)  '  <e(<0.l))otfx)(8)(l))) 

sex 

a,  for  s  =  x 

■  ‘WW5”  •  8n  for  s  .  x  111  'I 
seX  L  0  I 
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=  <J>A(X'/ 

(Note:  (e(sr  (X)oe(X)  )  (s)  =a5  and  (e<7(X) )»e(X) ) (s) ) (S  )  =  «s  •  ) 

s  ' 

In  addition,  #  as  defined  above  also  is  such  that: 

For  (Ml)  :  For  any  a  :  X  -*  7  (Y)  and  any  x  e  x  , 

( (#(a) )oe(X) ) (x)  =  (#(a))(ax) 

=  <flor(^&(0x(s)  '  a(s)  ( '  > )  > 
sex 

=  a(x) (•  ) 

Thus  (Ml)  holds. 

For  (M2)  :  For  any  $Ae  7{X)  ,  x  e  X  , 

(# (  e  ( X)  ) )  ( <t> A )  (x) )  =  <t>or(<t>&(<fA{s)  ,  e(X)  (s  )  (x)  )  )  =  ^(x)  , 

S€X 

Thus  (M2)  holds. 

In  addition,  note  again 

(i)  If  :  X  t  7  (V)  is  such  that  3  f  :  X  -*  Y  where  for  all 

x  €  x  ,  y  «  Y  , 

(<t>R(x)  )  (y)  =  Syf  f  (X)  < 

then  for  all  $A  e  7(X)  ,  y  «  Y  , 

(#(4>R)  )  (<t>A)  (Y) 

=  (V*)} 

x«f_1{y) 

=  (tr(f )  (<|>A) )  (y)  . 

(ii)  #  as  defined  above,  satisfies  (M3)  iff,  assuming  for  <f>&  , 
$or  -  is  right  distributive  over  <$>  (such  as  (min, max)  or 

(prod,  bndsv.m)  ,  in  which  case  fuzzy  composition  *  is  defined  by 
4>R  :  X  -  *(Y) 

<fs  :  Y  -  7(Z) 

$S,R  :  X  7(2)  , 

where  for  any  x  e  X  ,  z  e  z  , 

(4's.R<x5  )  (z)  “  1>or(*&U1>R(x>)(y).  («J»s(y))(z)) 
ye  Y 

=  (  (  (#(<t>s)  )°<?>R)  (x)  )(z)  , 
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^S*R 


#(^S)0^R  ’ 


and  *  is  associative. 


C.  A  ■odification  of  Manes'  theory. 

Many  of  Manes'  Fuzzy  Theory  concepts  simplify  when  the  lift¬ 
ing  operator  relation 

(1)  f  :  X  -  :r(Y)  to  (#)f  :  JT(X)  -»  T(Y) 
is  replaced  by  the  relation 

(2)  f  :  X  -*  Y  to  tT(f)  :  !T(X)  7(Y)  . 

For  example,  for  Manes'  axiom  (M3)  ,  an  associativity  form  of 

#  is  actually  quite  stringent.  It  was  shown  previously  that  indeed 
a  general  truth  functional  fuzzy  set  system  satisfies  (M3)  iff 

is  right  distributive  over  $or  (and  hence  includes  Zadeh's  origi¬ 
nal  min-max  system) .  On  the  other  hand,  it  is  shown  in  this  section 
that  with  the  relation  (2)  ,  (M3)  can  be  replaced  by  a  simple  asso¬ 
ciative  form  which  holds  for  all  truth  functional  fuzzy  set  systems 
as  well  as  probability  measure  systems. 

Some  additional  modifications,  in  addition  to  those  already 
presented  in  this  section  include: 

1 .  Theory  »aps: 

A  :  (y  ,e,tT)  -♦  (y  ’  ,e'  ,tf' ) 

is  a  theory  map  iff  for  all  X  €  SET  . 

(i)  A  ( X )  :  T(X)  -  tT'(X) 

and  for  all  X,Y  e  SET  and  f  :  X  -»  Y  . 

(ii)  A(X)(«  )  =  O'  ,  for  all  x  «  X  , 

(111)  tt  *  (f  )oA  (X)  =  A  (Y)-ft(f  )  . 

In  a  quotient  theory,  each  A (X)  is  required  to  be  surjective. 
If  (9"  , e , TT )  is  a  theory  and  for  all  X  €  SET  , 

a  (X)  :  tr(X)  -♦  HX  is  arbitrary,  first  define  *'(X)  by  (i),  for  all 
x  «  X  ,  o  ■  by  (ii),  and  define  rr  *  by  (iii).  Thus,  (ff.e.tt) 

and  a  always  induces  a  unique  quotient  theory,  where  a  is  a  sur¬ 
jective  theory  map  to  the  quotient  theory. 

In  particular,  if  (ff.e.rr)  is  a  theory,  and  for  all  X  e  SET  , 
e (X)  is  an  equivalence  relation  on  ^(X)  ,  then  define 
a  { X )  7(X)  -•  tr  (X)  /e  (X)  a  surjective  map  which  yields  a  quotient 
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theory  symbolized  as  (3/e,  e/c ,  rr/« )  .  with  A  the  theory  map. 

In  a  subtheory,  each  a(X)  is  required  to  be  injective.  Then 
the  converse  of  the  above  holds: 

If  (3',e,,TT')  is  a  theory  and  for  all  X  e  SET  , 
a  (X)  :  3(X)  -*  3'(X)  is  injective,  then  there  is  a  unique  way  to 

define  theory  ( tr ,  e , tt )  such  that  A  is  an  injective  theory  map 
from  (sr,e,n)  into  ( tr  ■  ,  e  *  ,  rt  •  )  .  In  particular,  given  any  theory 

(^'.e'.tt')  and  for  each  X  e  SET  ,  define  sr(X)  £  ^'(x)  ,  then  the 
mapping 

A  (X)  :  T  (X)  3  (X)  £  ST  •  (X) 

with  e  and  Tt  cut  down  to  3  ,  yields  an  imbedding  of  ( tr ,  e ,  ft ) 

into  (3',,e,,rt')  uniquely. 


2 .  Membership  functions  and  extensions. 

(a)  Consider  first  any  f  :  X  -*  {0,1}  ,  f  e  o'  ( X )  .  Then 

TT (  f  )  :  3(X)  -  9*  (  { 0 , 1  }  )  . 

In  particular,  for  any  set  C  e  »(X)  ,  if  $c  e  7(X)  ,  then 

TT ( 4>c )  may  be  considered  the  natural  extension  of  and  hence  of 

C  .  Note  the  range  of  Tt(f)  is  3{{C,1})  not  just,  e.g.,  3"  (  { 1 } )  . 

(b)  Similarly  for  any  f  :  X  -*  H  ,  f  e  3(X)  ,  TT(f)  :  3(X)  -»  3(H) 
may  be  considered  the  natural  extension  of  f  and  hence  letting  f 
=  4>a,  A  e  3(H),  rr ( <J>A )  is  the  natural  extension  of  A  .  Again,  note 

the  range  of  TT  { f )  being  3(H)  not  just  3  ( 1 )  ,  e.g. 


3 .  Equality  maps. 

Note  that  classical  equality  between  points  in  a  space  X  is 
given  by  as  the  diagonal  relation 

"="  s  X  x  X  , 


and 


<t>_  :  X  x  X  -*  {0,1}  , 

where 

*=(x.y)  =  1  '  lf  x  “  * 

.0  ,  if  x  *  y 

and  equivalently  the  associated  function  is 
id  :  X  -  X  . 


Then  using  approach  2  to  extensions,  as  presented 
2.3.7,  it  follows  easily  that 


in  section 
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$_  :  3-(X  x  X)  -*  7  {0,1} 


where  for  any  4>,  «  9"  (X  x  X)  ,  4  e  {0,1) 


4> 


(<)  =  (  t  (<j>A)  )  (  (2) 
zeXxX 

=(z)=4 


fzeXxX  I 

l=(z)=«J 


,  if  4=1 


*  <«,A({x'x)x€X) 

X€X 

f  „  V^'^x.yeX*  '  if  *  =  ° 
x:y«x  x4 

x*y 


B  e  3 (X)  ,  yields 

<Mx.x)  = 


whence 


^  {y }  X<^B ) 


(O  = 


A 

=  <?> 

{y)  X  *B 

for 

/ 

if 

x  =  Y 

« 

if 

x  *  y 

• 

(Y) 

for  4 

=  1 

0 

for  4 

=  0 

an  interesting  check  that  membership  should  be  a  special  case  of 
equality . 

More  generally,  note  the  case  ^a  ~  x  ^B  '  for  C,B  e  1 

does  not  really  simplify,  except  that  it  extends  Manes’  equality  map 
specialized  to  Zadeh’s  min  -  max  system. 

Alternatively,  if  approach  1  to  extensions  (section  2.3.7)  is 
formally  applied  to  <}>_  ,  then  for  any  A  e  g"(X  x  X)  , 

4U  (O  =  V(^=)_1{<)  ' 

yielding  a  similar  result. 

On  the  other  hand,  if  we  consider  id(X)  :  X  -*  X  ,  it  follows 
immediately  that  TT(id(X))  =  id(?  (X)  )  , 
i  .  e  .  , 


4U 

id  ( X) 


"  ♦id(9T(X))  • 


Note  that  Manes  considers  as  basic,  the  asymmetric 
procedure  f  :  X  -*  tr(Y)  extended  to  #(f)  :  3"  (X)  ■*  7  (Y)  . 

we  have  considered  the  full  extension  ft(f)  :  7  (X)  -*  7  (Y) 
f  :  X  -♦  Y  .  However,  note:  if  f  :  X  -»  7  (Y)  then 


extension 
Instead , 
from 


TT(f )  :  T(X)  -  7(7(Y)  )  , 


by  the  usual  extension.  Let  g  €  J(X)  .  Thus, 
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(TT(f))(g)  «  3"  (JT  ( Y )  )  and  hence  («(f))(g)  :  sr  (Y)  -*  H  . 

In  particular,  for  any  y  €  y  ,  cy  e  e(Y)  £  7 (Y)  .  Thus, 


(  ttt(f ) )(g)) (a* )  :  Y  -  H 


where  for  each  y  «  Y  , 

( (tt(f ) )  (g) )  (a-  >  ( y)  =  ( (tt(f))  (g) )  (a  )  . 

Hence,  in  general,  (1t(  f )  )  (g)  )  (8  •  )  «  3"  ( Y )  .  Thus,  for 
f  :  X  -♦  3"(Y)  ,  we  define: 

#(f)  :  3"  (X)  -  ff(Y)  . 

where,  for  any  g  e  3"(X)  and  y  e  Y  , 

<#(f))(g)(Y)  =  ((tt(f))(g))(8y)  . 


Thus,  #  as  used  here  corresponds  to  Manes'  #  . 
Furthermore,  note  that  #  is  an  extension,  since,  for  any 
x  e  X  and  hence  °x€  e(X)  £7(X)  ,  y  €  Y 

(#(f))(«x)(y)  =  (tr(f)(ax))(ay)  =  °f(x)(ay)  • 


Define 

5(X)  :  X  -♦  7(X)  ,  where  for  any  X  €  X  ,  e(X)  (x)  -  . 

Then  consider  Manes'  axiom  (Ml)  : 

For  any  a  :  X  -•  5J"(Y)  ,  y  e  Y  , 

( # (a )°e (X) ) (y) 


=  ( (n(a) }°e(X) (x) ) (ay)x€X 

=  <wa)i°V(Vixa 


^  ^Ba (x) ^  ^°y ^ *x«X 


*  a 


1  iff 
0  iff 


a(x)  = 
a (x)  *  0 

Y 


Next  consider  Manes'  axiom  (M2): 
For  any  £  e  7(X)  ,  x  €  X  , 

( (#) v  e (X) ) ) (g) ) (x) 

=  <(tt(e(X)))(g)(Ox) 

2  {(tT(V  Vx(g)){V 

s  g(x)  . 

Hence,  (#)(e(X))(g)  =  g  ,  i.e.. 
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#(e(X))  =  idg.{x)  . 

Finally,  it  should  be  remarked  that,  in  general,  we  cannot 
derive  Manes'  axiom  M3  for  #  .  Indeed,  for  the  truth  function¬ 
al  system  ( 4>nt ,  $or)  : 

For  any  a  :  X  -♦  3"{Y)  ,  /J  :  Y  -*  T(Z)  ,  for  all  f  e  7(X)  , 

Z  €  Z  , 

(  (#(/i)o#(a)  )  (f)  )  (z) 

=  *<*)) 
wa-M*  )1 

-!  Y 
X€/5  (flz)J 

On  the  other  hand,  it  can  be  shown 
((#(#(/J)oa))(f))(z) 

=  4>or(f(x)) 

x  €  a-1(  (#(/J)  )_1(0Z)  )  . 
where 

(#(/j))-1(s  )  =  {g  |  g  €  t(Y)  &  for  all  z'  e  Z  ,  z'  *  z  , 
z 

(g  I  P'Vj.*-  ))  =  0  &  <J>or  (g(y)  )  =  1} 

(y€/j_1(sz)  ) 

a  a  <»y  i  y  *  • 

Hence  using  the  mor.otonic  property  of  t-conorms: 

If  C  £  D  ,  then  4>or(h(x))  2  4lor(h{x))  ,  it  follows  that  for  all 

x«  C  xe  D 

f  €  9T  (X)  ,  Z  €  z  , 

((#(/>)«*#(«))  m)  <z)  2  (  (#(#(p)oo)  )  (f  )  )  (Z)  , 


with  strict  inequality  holding  in  general  unless 

(#(P))_1(az)  =  e  . 

z 

Furthermore,  as  mentioned  previously,  a  necessary  and  suffi¬ 
cient  condition  for  Manes  axiom  (M3)  to  hold  for  the  interpretation 
of  #  ,  given  in  section  B  and  below,  is  that  the  truth  functional 
system  ^not*  '  $or)  be  such  that  <|>&  is  right  distributive 

over  4 

Tcr 

Again,  conversely,  let  (tr.e,#)  be  a  given  Manes  theory.  Then 
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define  operator  ft  by,  for  any  sets  X,Y  and  f  :  X  -*  Y  ,  first 
noting 

e(Y)of  X  -  e(Y)  £  !T(Y)  , 

and  hence 


nit)  :  T(X)  -  !T{Y)  , 


where 

tt(f)  3  # { e ( Y)o f )  . 

It  follows  readily  that  all  of  the  axioms  we  require  for  ft 
(and  7  and  e)  to  satisfy  for  a  general  logical  system  indeed 
hold ! 

Thus,  Manes'  theory  nay  be  considered,  in  so  far  as  the  lifting 
operator  #  is  concerned  (in  conjunction  with  sr  and  e) ,  a  some¬ 
what  more  restricted  logical  system.  Hence,  we  cannot  expect,  in 
general,  certain  of  his  results  to  be  applicable  to  general  logical 
systems.  In  particular,  it  can  be  shown  that  Manes'  concept  of  in¬ 
dependence  of  distributions  and  homomorphisms  do  not  apply  here,  in 
general.  However,  consider  the  following  example: 

As  an  illustration  of  the  previous  point,  consider  Manes' 
system  as  applied  to  Zadeh's  min-max  system  and  its  natural  general¬ 
izations  : 

For  any  f  :  X  -  tr(Y)  define  #{f)  :  y(X)  -  :r(Y)  by,  for  all 
g  «  tr(X)  and  all  y  e  y  , 


( (#(f ) )  (g)  Hy>  «  sup  (min  (g(x),  f(x)(y)))  . 

X€  X 

More  generally,  for  any  truth  functional  system 
(*nof  V  $or>  coru^der 

( (#( f ) )  (g) )  (y)  ^  (4»- (g(x) »  f  (x)  (y ) ) )  . 

or  cx 
X€X 

Now,  #  is  a  legitimate  lifting  operator  in  Manes’  sense, 

except  that  axioms  Ml,  M3  are  violated,  unless,  as  stated  before 

is  right  distribution  over  <f>  -  which  is  the  case  for  Zadeh's 

«  or 

min-max  system.  (Again,  see  subsection  B.) 

Then  define  ft  by,  for  any  f  :  X  -*  Y  , 

ft(f)  =  #(e(Y)of)  ;  ft(  f )  :  «r(X)  7(Y)  . 

Thus,  for  any  g  e  tr  f X )  and  y  e  Y  , 

(ft(f) ) ( g ) ) (y)  =  ( (#(e(Y)of ) ) (g) ) (y) 

°  ^or^Sl9'x>'  (e(Y)of )  (x)  (y)  ) ) 
x«X 

=  <?>or(^(g(x).  «f(x) 

x«X 


(Y)  )  ) 
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=  <?>or  (g(x))  . 

xef_1(y) 

Thus  indeed,  rt  as  defined  above  coincides  with  ff  as  pre¬ 
viously  defined  for  truth  functional  systems!  Hence  the  above 
alternative  definition  for  tr  for  truth  functional  systems  is 
justifiable. 

Analogous  to  Manes'  development  of  the  concept  of  independence, 
first  define  the  following  natural  extensions  of  any 

f  :  X  x  Y  -♦  3{Z)  by  use  of  #  or  some  equivalent. 

Define  variables  by 

•  «  X,  ••  e  Y,  ...  €  z,  e  s(x),  e  s(Y)  . 

Then  with  obvious  notation,  for  any  x  €  X  , 
f(x,.*)  :  Y  -*  s(Z)  ,  and  hence  #•  •  f(xf«.)  :  sr(Y)  -♦  s'  { 2 )  ,  and  hence 

•  f  (•  ,•  •  )  :  X  -  3-(Z)9'(Y)  , 

equivalently 

r1(t)  3  (((#*.f(.  ,-•)(•))(;;.  )  :  X  3(Z)9'(Y)  , 

where  for  any  g  €  3(Y)  , 


({#••  f  (.,••)){.))  (g)  :  X  -  3(Z>  . 

Thus , 

#{ri(  f))  =  (#(((#• .f(.  ,••))(•))(;:.  )))  :  y(X)  -  3'(Z)3'(Y) 
yielding  equivalently 

j  (  f  )  )  :  3"  (X)  x  T  (Y)  s'  (  Z )  . 

Similarly  by  reversing  first  y  for  x  , 

*(r2(  f )  )  :  s  { X )  x  3-{Y)  -  3(Z)  . 

In  particular,  letting  f  =  i(X  x  Y)  :  X  x  Y  ->  3(XxY)  ,  if,  as 
before 

#(r(x  x  Y))  =  #(r1(l(x  x  Y)))  =  #(r2(e(X  x  Y) ) )  , 

for  all  sets  X  and 

#(r  (X  x  Y))  :  3(X)  x  3  ( Y)  -  3  ( X  <  Y) 


is  the  independence  mapping. 
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It  can  be  verified  that  for  #  as  defined  previously  for  truth 
functional  system  ($not'  '  the  indePendence  mapping  exists 

and  is  given  by,  for  all  f  e  3"(X)  ,  g  e  sr  (Y)  and  all  x  e  X  , 

y  e  y 

((#(r<x  x  Y) ) }  (f  ,g)  )  (x,y)  =  <t>& ( f  (x)  ,g(y) ) 

=  ( f  x  g) (x,y)  . 


Also,  one  can  define  for  any 
a  :  X2  x  X2  -  fT(Z)  , 

#(a)  :  3'(X1  x  X2)  -  a-(Z)  , 

as  usual ,  and  assuming 

r(Xx  X  x2)  :  3'{X1)  X  «r(X2)  -  7(XX  X  X2) 


exist , 


a  ^  (#(a))«r(X1  x  X2)  :  7  x  sr(X2)  -  7  (Z)  , 
and  it  follows  that  #(a)  and  a  extend  a  : 


aoe(X1  X  X2)  =  a  =  #(a)oe(X1  x  Xg  )  . 

A  natural  question:  What  is  the  relation  between  a  ,  #(a) 

and  # (r (a) )  =  #(r^(a))  =  #(r2(a})  ,  when  the  latter  exists?  In 

Manes’  theory,  It  follows  by  uniqueness  of  2-homomorphic  extensions 
and  the  construction  of  r(a)  (see  his  theorem  5.7  [169]),  that 

a  =  r(a)  .  However,  due  to  the  failure  of  axiom  (M3)  to  hold  in 

general,  each  a  need  not  be  a  2-homomorphism  in  Manes’  sense,  the 
multiple  factor  generalization  of  a  function  arising  as  an  extension 
through  #  ;  although  when  /"(a)  is  well  defined,  the  latter  clear¬ 
ly  is  such  an  extension.  Thus  also,  Manes'  characterization  (again 
see  Theorem  5.7  [169])  of  universally  commutative  Y-ary  operations 

being  equivaler. :  to  unique  multi-homomorphism  extensions  fail  here. 

Again,  two  important  special  cases  should  be  mentioned  where 
all  three  Manes  axioms  hold  and  hence  where  all  of  Manes'  results  on 
n-homomorphism  extensions  and  commutative  systems,  among  others, 
hold  are: 

(1)  Zadeh's  fuzzy  set  system  (1  -  (• ),nin,max)  where  for  any  X  , 
T(X)  =  [0,1]X  ,  U(X)  =  X 

e(X)  :  X  -»  !T(X)  ,  where  for  all  x,y  e  X  ,  e(X)(x)(y)  =  6  , 

x ,  y 

and  for  f  :  X  -♦  7  (Y)  ,  for  all  g  €  ( X )  ,  y  «  Y  , 
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(  (#(f ) ) (g) ) (y)  =  sup  min  ((f(x))(y),  g(x))  . 
xeX 


(2)  Discrete  probability  theory:  the  same  as  the  fuzzy  set  system 
(!-(•),  prod,  bndsum)  where 


0,  if  X  is  not  discrete  , 

class  of  all  prob.  fucntions  over  X  , 

if  X  is  discrete 


e(X)  :  X  -•  ir  { X )  ,  where  for  all  x,y  e  X  , 

(e(X) (x) ) (y)  =  sv 

A  /  y 

and  for  f  :  X  ->  «r  (V)  ,  for  all  g  €  g'(X)  ,  y  €  y  , 

( (#(f) ) (g) ) (y)  =  *  (f (X) ) (y)-g(x)  =  E_{ f (*)(*•) )  • 

xeX  9 

Note  that  both  examples  above  are  special  cases  of  the  system 
l^not'  $0r*  '  *  defined  as  before,  where  in  (1) 

<t>&  =  min,  <f>or  =  max  and  in  (2)  <f>&  =  prod,  $Qr  =  bndsum  .  Note 

also  that  for  both  systems  -  as  a  check  with  the  characterization  of 
such  Manes1  truth  functional  systems  -  have  right  distributive 

over  <t> _ . 

T  or 

So  far,  we  have  considered  the  extension  operations  rr  , 
lifting  functions  f  ;  X  -♦  Y  to  Tl(f)  :  3(X)  -•  3  (Y)  and  #  lifting 
functions  a  :  X  -»  s  (Y)  to  #(a)  :  3(X)  -♦  3(Y)  and  the  imbedding 

operation  e  :  X  -*  e(X)  £  3"(X)  . 

The  question  then  arises  as  co  the  converses  of  these  oper¬ 
ations  . 

For  example,  consider  the  converse  of  the  imbedding  operation 
e  .  Can  we  obtain  a  natural  "subjection"  (in  order  to  avoid  con¬ 
fusion  with  "projection")  S,  where  for  all  X  ,  and  given 
U(X)  £  »(X)  ,  S(X)  :  a-  (X)  -»  U(X)  is  surjective.  Watanabe’s  exampl 

[267]  furnishes  such  as  mapping,  when  U(X)  =  3(X)  ,  ^ ( X )  =  [0,1] 

and  under  S(X)  ,  any  f  e  ar(X)  is  pushed  to  an  image  representing 
the  membership  function  of  an  ordinary  set. 

In  turn,  any  mapping  such  as  h  :  s'  ( X )  ->  3  ( V )  may  be  reduced 
in  two  easy  stages,  using  again  the  notation  e(X) (x)  =  ,  for  all 

X  €  X  , 

(i)  s ( h )  :  X  -  3(Y)  , 

where  for  any  x  e  X  ,  s(h)(x)  -  hjs^)  . 

(ii)  t  (h)  :  X  U(Y) 

where 

t  ( h )  =  s  ( Y ) o  s  { h)  . 


3  ( X )  = 


4)  X 


Some  Theories  of  Uncertainty 


459 


Note,  of  course,  the  compatibility  of  step  (i)  when 

h  =  #(f )  .  f  :  x  »(Y)  ,  i.e.  . 


s(#( f ) )  =  f  . 

In  summary,  a  certain  arbitrariness  marks  any  logical  system 
Even  Manes'  very  structured  system  with  many  useful  properties  - 
such  as  the  uniqueness  of  homomorphic  extensions  (see  his  Theorem 
5.7  [169])  -  still  depends  on  the  choice  of  lifting  operator  (in  his 
case  #)  and  <r  operation.  (The  imbedding  operation  is  usually 
naturally  determined.)  y 

As  Klement  mentions  in  his  paper  on  triangular  norms  [140], 
investiSations  (such  as  those  by  Zimmermann,  et  al .  [250], 
[291])  are  also  needed  for  better  semantic  modeling. 


Summary  of  extensions 


All  the  results  below  may  be  extended  in  the  obvious  way  to  an 
arbitrary  number  of  finite  factors,  by  replacing  r(X  x  x  )  by 

r(X1x...xXn)  . 

I.  f  :  Xj  x  Xj  -t  Y  ,  given 
We  always  have 

e(X1  x  X2)  :  Xx  x  X2  -  e(XJL  x  X2 )  S  x  X2 )  . 

Assuming  r  exists, 

r<Xl  x  X2)  :  sr(X1)  x  jr(X2)  -  «r(X1  x  X2 )  . 

Thus, 


e(Y) 

®  f 

:  Xx  x 

X2 

-  *<Y)  . 

(i) 

If 

# 

is  first 

available , 

define  n 

by 

tt(f ) 

§ 

#(e(Y)< 

»  f  ) 

-  *(f)  : 

* (X:  x  X2) 

-  7  (Y) 

In 

additi 

on. 

7  £ 

#(f 

)®r(x1 

X 

x2)  :  !T(X1 

}  x  y(x2)  - 

9"(Y) 

(ii 

)  If 

rr 

is  first 

available 

,  then  directly 

Tt(f  ) 

5-(X1  X 

X2 

)  -  T(Y)  , 

and 

again 

in 

turn 

7  =  tt(  f  )°r  (x 


1  X  X2)  :  ^ ( Xj )  x  T(X2) 


*(Y)  . 
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II . 


f  :  X:  x  X2 


(1)  If  * 
#(f)  : 


-*  r  (Y)  ,  given 

is  first  available, 
fT(X1  x  X2)  -4  9"  { Y ) 


then  directly 


and  hence  assuming  r  exists 

7  3  #(f)«r(X1  x  x2)  :  ^(X^  x  3-(X2)  7(Y)  . 


(ii)  If  Tt  is  first  available,  then 
TT  (  f )  :  9-{X1  x  X2)  -  3-  (9"  (Y)  ) 

and  in  turn,  letting  •  €  <r(X1  x  x2)  ,  ••  e  Y  , 

#(f)  3  ( (tt(  f )  )  ( *  )  )  (e(Y)  ( •  •  ))) 

:  9'(X1  x  X2)  -4  O’  ( Y)  . 


and  hence 


7  3  #(f)or(xi  x  x2) 


^(Xj)  x  sr(X2) 


tr(Y)  . 


In  particular,  all  of  these  results  could  be  applied  to  the 
classical  Boolean  polynomials  -  or  truth  table  function  operators, 
given  by  the  classes 

Booln  3  {f  |  f  :  { 0 , 1 } n  -  {0,1})  ;  n  =  1,2 _ 


Lastly,  consider  Manes'  concept  of  "distributions  as  opera¬ 
tions"  (see  [169],  Theorem  3.3). 

(1)  If  #  is  first  available.  Manes  shows  for  any  Y  e  SET  ,  the 
class  of  all  commutative  Y-ary  Z -operat t ons  satisfies: 

CY  ”  {7Y  1  *Y  :  SET  *<Ar(SET>)  »  where  for  all  X  e  SET  , 

3-y(X)  :  9"(X)Y  -  «r(X)  and  for  all  Z  e  SET  and  all 
P  :  X  sr  ( 2 )  , 

#(/3)og-y{X)  =  ?y(Z)°  (#(P)  )Y 

where  (#(/J))Y  :  9'(X)Y  -4  7(Z)Y,  where  for  all 
a  :  Y  -«  7  (X)  ,  and  y  €  Y, 

(#(/3)  )Y(a)  (y)  3  (#(p)oa)  (y)  }  , 

=  {f  |  f  :  SET  -4  ?  ( Ar  ( SET )  )  ,  where  f  €  7(Y)  and  for  all 
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X  e  SET  ,  ?{X)  :  3-(X)Y  -  !T(X)  where  for  all 
a  :  Y  -  T (X)  , 

( ? ( X) ) (a  )  £  ( #{a ) ) ( f )  , 

with  the  bijective  relation, 

f  — .  ?  ,  all  f  «  o-(Y)  , 
and  where  for  any  , 

3-y  =  {  (3"Y(  Y)  )  (  e  ( Y )  )  )"}  . 

(2)  If  the  less  restrictive  n  is  first  available,  analogous  to 
Manes'  proof,  we  can  show 

c  y  =  {try  |  !Ty  :  SET  -  9  ( Ar  ( SET )  )  ,  where  for  all  X  €  SET  , 

9-  (X)  :  XY  7(X)  and  for  all  2  e  SET  and  ail 

p  :  x  -*  Z  ,  =  7y(Z)°/JY  ,  Where 

/3Y  :  XY  -♦  ZY  ,  where  for  all  a  :  Y  -»  X  ,  and  all 
y  e  Y  ,  (pY(a)  )  (y)  £  (/5°a)  (y)  €  Z}  . 

=  (f  |  {  :  SET  -  9  (Ar (SET) }  ,  where  for  all  X  e  SET  , 
f  ( X )  :  XY  -»  tr(X)  where  for  all  a  :  Y  -*  X  , 

(  f  ( X )  )  (a  )  ^  (tt(a))(f)  ,  with  the  bijective  relation 

f  ♦-*  f  ,  all  f  e  Tv’Y)  ,  and  where  for  any  , 

*y  =  (  (3-y(Y)  )  (idy)  )~>  . 

Note  also, 

(?(Y) ) (idx)  =  f  . 

Thus  the  interesting  characterization  by  Manes  for  all  commu¬ 
tative  Y-ary  operations  does  not  in  general  extend  to  the  less 

restrictive  tt  system.  However,  by  specializing  Y  to  {1 . n}  , 

say,  the  second  result  shows  for  cartesian  n-products  there  is  a 
bijective  relation  between  commutative  n-ary  functions 

7y(X)  :  Xn  -*  9(X)  ,  X  arbitrary,  and  ?  for  all  f  e  T{l,...,n)  , 

where  for  any  x  e  Xn  considered  as  a  function  x  :  -*  X  , 

where  x(j)  =  Xj  ,  j  =  l,...,n  ;  x  =  (x1,...,xn)  , 

?(X)  (x)  =  (tt(x)  )  (f  )  . 
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Thus,  because  of  the  excessive  restriction  of  axiom  (M3),  many 
(but  not  all)  of  Manes '  elegant  results  do  not  hold  for  the  tt 
system  but  of  course  will  hold  when  #  is  replaced  by  tt  (which  is 
always  compatible,  as  shown  previously). 

7 . 3  Watanabe's  systea. 

Consider  now  in  some  detail  Watanabe's  fuzzy  set  theory  which 
also  has  a  strong  probabilistic  connection  [268]: 

Let  X  oe  an  fixed  nonvacuous  space  of  points  or  objects  and 
?1(X)  (corresponding  to  s(X))  a  space  of  attributes  or  predicates 

or  fuzzy  sets,  etc.  Let  <[>  :  X  x  7^(X)  -*  [0,1]  be  fixed  such  that 

for  any  x  €  X  and  A  e  ?1(X)  ,  <p(x,A)  =  <|>A(x)  €  [0,1]  .  Identify 

x  =  y  ,  iff  4>A(x)  =  4>a(y)  for  A  e  s  and  dually  A  =  B  iff 

cp A ( x )  =  ( x )  for  all  x  e  X  Suppose  also  0  ,  0  €  ^(X)  with 

the  evaluations:  <|>0  =  0  ,  $n  =  1  .  (For  convenience,  let  X  =  0  .) 

Let  p  :  X  x  9'1(X)  -♦  ?  ^  ( X )  be  fixed  and  define  composition  °  : 

» j  (X)  x  ^(X)  -»  ?1  (X)  by,  for  any  A,B  e  5(X)  ,  and  x  e  X  , 

^AoBU)  =  <t>A(p(x,B)  )-<t>B(x)  . 

°  is  not  associative  nor  symmetric  in  general. 

Let  the  simple  predicate  class  *(X)  be  defined  by 

y(X)  -  (A  |  A  €  S^X)  &  A°  A  =  A) 

=  (A  |  For  all  x  €  supp(A)  -  (x  |  $A(x)  >  0}  . 

$A(p(x,B))  =  1  ,  i.e.,  p ( x , B )  €  A). 

Thus,  0,X  €  y(X)  . 

(i)  If  A,B€  y(X)  and  A°p  =  B° A  ,  then 
A°B  =  B°A  e  S  (X)  . 

(ii)  If  A,B  e  y  (X)  with  A°B  =  B°A  =  A  ,  we  write  A  «  B  ,  an 
implication  or  deduction  sequent  between  A  and  B  ,  noting 
$B  >  pointwise  over  X  ,  if  A  y*  B  .  Also, 

AwA  ;  A  v*  B  ,  BmC  implies  A  C  ; 

A  =  B  iff  (A  «  B  &  B  «  A)  ; 

0  «  A  M  X  ,  for  all  A,B,C  €  y(X)  . 

(iii)  Thus  m  induces  a  partial  order  over  y(X)  ,  which  in  turn 
yields  least  upper  bound  and  greatest  lower  bound  operators 

u  ,n  :  f  (X)  x  f  (X)  -♦  f  (X) 

E.g.  ,  A ,  B  w  A  U  B  and  if  A ,  B  ►*  C  then  A  U  B  m  C  ,  which  expli- 
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citly  is  the  same  as,  for  any  A,B  «  ^(X), 

AU  B  =  C ( • • • »c (A)°C (B)°C (A)oQ (B) )  €  y(X) 

(unambiguously  well-defined) 

A  n  B  =  ( • • • oa°B°A®B)  «  y(X)  (unambiguous  well-defined) 

where  complemental  operator  C  :  *  -*  *  is  also  assumed  well- 

defined,  where  $  (x)  =  1  -  4>.(x)  ;  x  e  X  . 

C  A  A 

If  A°B  =  B°A  then  it  can  be  shown 
An  B  =  B  fl  A  =  A»B  =  B»A  . 

(iv)  It  also  follows  that  (y(X),  n,  u,  c)  is  a  complete  comple¬ 
mented  lattice  which  is  also  symmetric,  associative,  idempotent  and 
DeMorgan,  but  in  general,  not  distributive.  Also,  C  is  involutive 
with  the  Law  of  Excluded  Middle  holding: 

A  n  CB  =  0  ,  A  u  CA  =  X  ,  and  A®C(A)  =  C(A)°a  =  0  : 
the  Law  of  Self-Contradiction: 

A  «  CA  iff  A  =  0  . 

If  A°B  =  B°A  then  A® C ( B )  =  C(B)°A  and  C(A)»  C(B)  =  C(B)° 
C (A)  ,  and 

♦boa  +  Vc(A)  =  (pointwise)  ; 

An  C(B)  =  0  &  A«>  B  =  Bo  A  implies  A  w  B 
A  m  B  implies  A  n  C(B)  =  0  , 

A  ►*  B  iff  C(B)m  C(A)  (Modus  Tollens)  . 

(v)  Iff  for  all  A,B  e  :f(X)  ,  A°B  =  B°A  ,  *(X)  is  also  distribu¬ 
tive;  in  which  case ,  for  any  x  e  X  ,  $  ^ .  j (x)  :  y  (X)  -♦  (0,1]  is  a 

finitely  additive  probability  measure  over  f  .  Hence  in  the  lat¬ 
ter  case,  (<()j  #  j  (x)  )x6jc  is  a  family  of  probability  measures. 

(vi)  If  for  all  A , B  e  y(X)  ,  A°<B  =  BoA  and  y(X)  is  finite  in 
cardinality,  then  in  addition  to  the  results  of  (v)  ,  there  exist 
Aj'.t.'A  e  ^(X)  for  some  nil,  called  atoms,  such  that  w.r.t. 


n  and  u  ,  they  form  a  disjoint  partitioning  of  X  and  such  that 
for  any  A  €  y(X)  ,  if  A  »  A.  ,  then  either  A  =  0  or  A  =  A.  ; 

i  3 


j  =  1, . . . 

,  n  . 

This 

implies  that  any  A  e  y(X)  can  be  written 

uniquely 

A  = 

u 

AJ  ' 

’1 

fisjsn 

lAj  w  a 

J 

which  in 

turn 

implies 

for  any  x  e  X  ,  we  have  a  probability  func- 
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and  hence 

n 

<t>A(>0  =  *  :  a11  x  6  x  • 

Watanabe  realized  the  above  scheme  in  a  geometric  setting. 
However,  his  important  example  appears  somewhat  ambiguous  in  regard 
to  the  interpretation  of  the  composition  operation  and  whether  y ( X ) 
in  the  example  should  be  distributive  or  not.  We  present  a  simple 
unified  approach  here,  interpreting  composition  as  ordinary  matrix 
multiplication: 

Let  X  correspond  to  a  subclass  of  all  products  G  of  posi¬ 
tive  semidefinite  matrices  Gj  ,  where  G  is  normalized  with 

respect  to  the  trace  operation,  i.e.,  G  =  ( 1/tr (Gj* • G  ) ) • Gj • Gj • • G  , 

nil,  whence  tr(G)  =  1  .  (Thus  any  (l/trG^)G^  is  a  quantum 

mechanics  object.) 

Let  ?,{X)  correspond  to  some  subclass  of  all  A  or  I  -  A  , 
1  n 

where  A  =  A  ••A„  and  matrices  A.  are  positive  semidefinite  with 
l  m  j 

maxeig  (Aj)  i  l  .  Let  composition  °  be  ordinary  matrix  multipli¬ 
cation  so  that  /(X)  is  the  class  of  all  idempotent  (or  projection) 
matrices  intersected  with  ^1(X)  with  typical  element 

m  T  n 

C  =  Z  P.P.  ,  for  some  P,,...,P  e  R  orthonormal,  min.  It 
j=l  J  J  1  m 

then  follows  by  the  fundamental  commutativity  theorem,  for  any 
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m  m 

A  =  z  p-p-  <  B  =  z  P"P"T  €  y(x)  ,  letting 
3=1  J  J  j=l  J  J 

p'  -  <pi . p;->  * p"  -  <pi . p;..>  . 

AB  is  symmetric 

iff  AB  e  y (X) 

iff  AB  =  BA 

iff  P'  u  P"  s  {P^,...,p^}  orthcnormal  in  Rn  , 

in  which  case, 


*  T*  —  ~  n 


2  pipiT  € 

lsjsn  ) 


[p^eP‘np"J 

Define  <J>  for  any  G  e  X,  A  «  Jj(X)  ,by 
<t>A(G)  =  tr(A-G)  s  1  . 

Note  also  that  p  here  becomes 

p(G,B)  =  ( 1/tr (BG) - B- G  , 

for  all  G  «  X  ,  B  €  * 1 (X)  . 

Then  for  any  A , B  «  *(X)  ,  A  »  B  becomes  the  relation 
B£A,i.e.,w.l.o.g. 

A  =  *  PiPjT  '  B  =  Z  PiP1T  S  • 

J€/l1  J  •>  j€/l  ^  J  J  2  1 

Also*  A  n  B  and  A  u  B  have  interpretations  in  terms  of 
matrix  products.  In  particular,  if  AB  =  BA  ,  then  letting 

A  =  *A  "/j"  '  B  =  *  PJP3T  -o-  '•i  •  *  <1 . n,  . 

1  2 

A  H  B  =  AB  =  BA  =  ZPPT, 

J  j 
;,€/1lf1/12 


A  U  B  =  Z  PjPj' 


J€/ilUA2 

in  which  case  for  all  g  e  x  , 


4>a(g)  =  z  p,T-gp, 

A  J«/lj  j  J 


fn ( G )  =  Z  P.  GP.  , 

B  j«2  J  J 
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WSI  ■  1  PjTsPj  :  WG>  -  *  ' 

je/tjO/lj  J€/11U/12 

where  it  can  be  shown  that  the  same  bounds  ,  as  obtained  for  all 
copulas  and  their  Demorgan  t-conorm  transforms, hold  here. 

In  addition,  define 

CA  -  I  -  A  . 
n 

Note  that  if  A  =  z  P  .P  T  e  *(X)  ,  A  £  {1 . n}  , 

j€/i  J  J 

then 

CA  =  I  -  A  =  Z  P.P  .T  e  f  (X)  . 

n  J  J 

je  { l ,  .  .  .  ,n}-M 


Also,  for  any  A  e  s^X)  ,  G  «  x  , 


$ca(G)  +  <pA(G)  =  tr (AG  i-  (In  -  A) G )  =  tr (G)  =  1  , 

etc . 

Thus  all  desired  results  in  the  general  case  may  be  realized  in 
this  example,  which,  in  general,  yields  (y(X),  n,  u,  C)  as  a  non¬ 
distributive  system  (with  all  other  properties  for  the  general  case 
being  valid) .  Thus  the  system  will  become  distributive  (by  (v)  ) 
iff  for  all  A , B  e  y(X)  ,  AB  =  BA  ,  i.e.,  there  exists  a  fixed 

orthonormal  matrix  P  =  (Pj,...,P  )  such  that  all  A  e  f (X)  may  be 

T 

written  as  A  =  z  P .P .  for  some  A  s  (l,..,,n)  ,  etc. 

J  1  A 

j^A 

In  addition,  Watanabe  defines  an  extended  information  theory, 
where  for  any  G  =  G(P,W)  «  X  , 


n 


Ent(G)  =  -tr ( G* ( log  G) )  =  Ent(W)  =  Z  -  w.  log  w,  . 

j  =  l  J  J 


Watanabe  then  extends  the  following  basic  theroem  in  a  natural  way 
to  the  above  example:  The  entropy  of  a  joint  probability  function 
which  has  independent  marginals  is  the  sum  of  the  entropies  of  both 
and  exceeds  the  entropy  of  any  other  joint  probability  function  with 
the  same  specified  marginals. 

Finally,  Watanabe  proposes  that  in  the  pattern  recognition  or 
classification  problem,  one  may  re-interpret  the  membership  function 
within  the  same  basic  geometric  setting  as 
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class  or  pattern  against  which  observation  x  =  •  e  X  s  Rn  is 

x 

nj 

to  be  tested  for  membership.  In  practice,  Watanabe  concludes, 

rnk(A)  will  be  of  a  relatively  low  dimension  compared  to  both  n 
and  sample  sizes  used  to  empirically  establish  a  typical  candidate 
A  . 


7.4  Gainss 1  ancer  taint's 


We  discuss  here  a  simplified  but  more  general  variation  of 
Gaines'  uncertainty  logic  [66],  [68]  which  simultaneously  extends 

ordinary  probability  logic  and  Zadeh's  fuzzy  sets. 

Let  X  be  any  nonvacuous  set  of  predicates  and  ,  "ui "  h,- 

two  arbitrary  binary  operations  defined  on  X  which  are  symmetric 
and  associative  and  "not"  a  unary  operator  on  X  .  Let  3*(X)  be 
the  class  of  all  finite  well-formed  strings  from  X  under  these 
operations  with  distinguished  elements  T  and  F  e  »(X)  such  that 
for  all  p  e  »(X)  , 


(T  &  p)  =  p  =  <F  or  p) 

.(T  or  p)  =  T  ;  (F  &  p)  =  F  . 


(Note  the  similarity  to  the  usual  meaning  of  »(X)  :  the  power 

class  on  X  .  ) 

For  any  sequence  P1<P2,<--  «  »(X)  , 


pl&p2&' ‘ '  €  :  Pi  or  p2  or  ’ ’ '  €  *<x)  • 


Let  II  •  II  :  ?(X)  -»  [0,1]  be  a  truth  mapping  such  that 
(  1  )  II  (T)»  =  1  ,  II  (F)ll  =  0  ; 

For  any  p,q  e  9  (X)  » 

(2)  ii  (p  &  q) II  s  min  (iipii  ,Hqii  ) ) )  ; 

(3)  ii  (p  or  q)  li  2:  max  ( II  pli  ,  II  qll  )  ; 

(4)  n  •  ii  is  continuous  from  above  and  below,  i.e.,  for  any 
sequence  pa ,  p2 , • • •  €  »(x)  - 

"  (Pj  &  P2  &■••)»  =  lim  ii  (p:  &  p2  Pn)» 

n-«® 

ii  (Px  or  p2  or---)"  =  lim  »  (Pj  or  p2  or---or  pn)ii  • 

n-*°» 

Call  the  system  (?(X) ,  &,  or)  an  uncertainty  logic. 

Now  suppose  in  addition  to  (3>(X),  & ,  or)  being  an  uncertainty 
logic , 

(PI)  and  "or"  satisfy  in  addition  the  basic  relations  of 

mutual  distributivity  and 

li  •  it  is  a  valuation  mapping: 

For  all  p,q  e  ? ( (X)  , 

H  (p  or  q)  li  =  llpii  +  li  qi)  -  li  ( p  &  q)n 


(pin 
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and  the  negation  law  holds. 

(PHI)  1  =  ii  (p  or  not  ( p)  ii  =  ii  pu  +  u  (not  p )  it  . 

From  this,  we  can  show  for  any  p,q  €  ?(X):  the  Law  of 

Excluded  Middle  holds: 

ii  (p&  not  p )  ii  =  0  ,  since  1  =  n  (p  or  not  p)n  =  npn  +  n(not  p)n 

-  H  (p  &  not  p ) ii  =  1  -  ii  (p  &  not  p ) ii  , 

Hqii  =  H  (  ( p  &  not  p )  or  q)ii  =  it  ( p  or  q)&(not  p  or  q)H 

=  n((p  or  not  p)&  q )  II  =  n  (p  &  q)or(not  p  &  q))n 

=  il  (p  &  q)n  +  ii  (not  p  &  q )  ii  . 

Also , 

ii  (p  or  q )  Il  =  llpii  +  (iiqli-il(p  &  q)H)  =  H  pit  +  il(not  p  &  q )  II 
noting  ii(p  &(not  p  &  q)n  =  0  . 

If  ii  (p  &  q )  il  =  il  (p  or  q )  il  ,  then 

ii  ( p )  il  =  il  (p  &  q)ll  =  il  (p  or  q )  II  =  linn  . 

Also,  idempotency  holds 
II  pH  =  il  (p  &(p  or  not  p))n 

=  ii((p  &  p)  or  (p  &  not  p)  )  il  =  Il  (p  &  p )  n  . 

Also,  DeMorgan's  Laws  hold 

il  (not  p  or  not  q)U  =  n  (not  p)n  +  n(not  q)li  -  n(not  p  &  not  q)n 

=  ii  (not  p )  ii  =  il  (not  p  &  not  q)n  +  iinot  qii 

=  ii  (not  p  &  q ) li  +  iinot  qii 
*  (llqii  -  li  (p  &  q )  ll  )  +  li  not  qii 
=  1  -  II  ( p  &  q )  il 
=  ii  (not(p  &  q)  )il  , 

it  (not  p  &  not  q)n  =  iinot  pH  +  iinot  qii  -  ll(not  p  or  not  q)n 

=  1  -  llpll  +  1  -  llqii  -  il(not(p  &  q )  )  li 
=  1  -  (llpll  +  llqii  -  II  (p  &  q )  )  li  ) 

=  1  -  ll  (p  or  q)  ll 
=  ll  (not(p  or  q )  )  ll 

Also,  we  have  countable  additivity: 

For  any  p^p^...  <=  »(X)  with  li  (pi  &  Pj))n  =  0 

for  all  i  *  j  , 

ii  (PjOr  p2or- •  •  or- •  •  )  ii  =  limn  (p^or*  •  or  Pn)n 
i  £  n-*<»  “  ‘ 

n  ® 

=  lim  z  ii p  ii  =  z  up. ii  . 
n-®  j=l  :  j=l  3 

This  yields  a  probability  logic  (see  section  2.3.9  (A)  and 

Rescher  [214]),  where  it  should  be  noted  that  here  the  mapping 

ii  •  ii  :  *{X)  -•  [0,1]  is  not,  in  general,  a  truth  functional  in  the 

sense  that  there  is  no  function  such  as  <t>r  for  nor  <t>  for 

T&  Tor 

"or"  nor  cf>not  for  "not",  such  that  for  all  p,q  €  ?(X)  - 

ii  (p  &  q)u  =  4>&(  H  p»  ,  il  qii  )  , 
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as  is  true  for  many  general  logical  systems  -  including  general 
fuzzy  set  systems,  where  is  a  r-norm,  ,  a  t-conorm,  $not  > 

a  negation,  etc.  However,  do  not  confuse  this  with  the  probability 
algebra  system  9'  =  4>&.  $or)  =  (1  -  (•)-  prod,  bndsum) 

relative  to  discrete  spaces,  where,  in  effect,  the  predicates  p  « 
?(X)  are  replaced  by  probability  functions  over  X  . 

Thus,  e.g.,  suppose  we  let  ?{X)  be  (the  real  Borel  field 

on  R2)  with  &  =  n  ,  or  =  u  ,  not  =  C  ,T  =  R2  ,  F  =  so  and  il  •  n 
be  any  probability  measure  over  ®2  ,  then  we  do  have  a  truth 

functional  system  over  all  pairs  of  events  of  the  form 
d  d 

Px  =  (— ®,X]XR  ,  qy  =  Rx  (  -oo  ,  y  ] 

for  any  x,y  e  R  ,  since  by  Sklar's  theorem,  corresponding  to  II  •  n 
there  is  a  unique  copula  cf)&  such  that  for  all  x,y  €  ir 

ii  ( p  &  q  )  il  =  cb  r  ( il  p  il  ,  II  q  II  )  , 

My'  T&'  *x  ^y 

and  hence 


il  (p  or  q  )H  =  Up  il  +  il  q  il  -  i}>P(llp  n  ,  n  q  II) 
vix  Hy'  *x  ^y  *x  ^y  ' 

(although  in  general,  unless  <f>&  is  Frankian  in  form  -  see  2.3.6  - 

ii  (p  or  q  )n  is  not  a  legitimate  t-conorm)  and 
x  y 

»not  p  11  =  1  -  »p  11  . 

X  *x 

Particular  examples  of  choices  for  n  •  il  include  =  min, 

corresponding  to  the  two  marginal  random  variables  being  maximally 
(positively)  correlated,  =  prod,  which  Gaines  calls  the 

"stochastic  logic"  case,  corresponding  to  statistical  independence 
of  the  marginal  random  variables,  and  =  maxsum,  corresponding  to 

the  marginal  random  variables  being  most  negatively  correlated. 

However,  in  general,  for  the  same  probability  space  as  above, 
we  do  not  have  a  truth  functional  system  over  all  ^(X)  ,  since, 

e.g.,  take  p  to  be  any  fixed  subset  of  E2  such  that  (assuming 

one  exists)  il  pH  =  1/2  .  But  then  take  not  p  =  Cp  .  In  this  case, 
ii  C pH  =  1/2  also,  but  note 

H  (p  &  p )  II  =  1/2  >  0  =  II  (p  &  Cp)  il  . 

On  the  other  hand,  it  follows  that  any  general  fuzzy  set  sys¬ 
tem  7  =  (<t»not,  <|»s,  $or)  -  <}>not  a  negation,  <$>&  a  t-norm  and  $Qr 

a  t-conorm  -  over  ?(X)  is  also  an  uncertainty  logic  where  ii  •  H 
may  be  extended  unambiguously  to  satisfy  the  continuity  conditions, 
by  using  the  continuity  and  nondecreasing  properties  of  and 

Tor 

Gaines  presents  a  scheme  for  empirically  modeling  any  truth 
mapping  over  an  uncertainty  logic.  (Again,  we  modify  it  to  fit  our 
more  general  approach.): 
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Evaluate  empirically  first  X  by  the  usual  sample  percentage 
response  of  a  population  answering  affirmatively  to  the  question  "Is 
p  true  or  not?"  ,  for  each  p  e  X  .  This  corresponds  formallv  to 
the  estimation  of  unknown  parameter  n pn  for  a  zero-one  random  var¬ 
iable  Vp  .  For  a  probability  logic  approach,  interpret  "or", 

"not"  in  the  usual  senses  and  estimate  compound  predicate  truth 
evaluations  such  as  n(p  &  q ) n  by  those  who  replied  "yes"  to  both 
"p  true?"  and  "q  true?"  ,  while  for  li(p  or  q)H  ,  obtain  the  per¬ 
centage  who  reply  affirmatively  to  one  or  both  questions,  etc.  In 

this  case,  it  follows  that  the  estimator  n  •  i)  of  n  •  n  can  easily 
be  made  to  fit  the  required  probability  logic  structure.  Since  a 
general  fuzzy  set  system  is  truth  functional  we  need  only  obtain,  in 

addition  to  n  pn  ,  p  e  X  ,  as  above,  empirically  <$>„  ,  *  ,  * 

r  1  T&  Tor  Tnot 

by  discretizing  [0,1]  into  disjoint  cells  C.  =  [  —  ,—-]  ,  j  = 

3  mm 

0,1,..., m-1,  picking  any  two  cells  and  ,  obtaining  all 

p,q  e  X  for  which  n  pil  e  and  ilqii  e  C_.  ,  then  computing  for 

each  such  pair  p,q  ,  $&(p,q)  =  n(p,q)i!  as  obtained  in  the 

probability  logic  approach,  and  finally  compute  the  pooled  sum 

H  r  H  (  p ,  q )  il /card  (  C  .)•  card  (  C  . ))  where  card(C.  )  is  the  number 
^over  all  p,q\  J  K 

VI  pil  €  C  i  ,  II  qll  e  C  / 

of  p  e  x  which  have  ilpil  €  ,  k  =  i,j.  Analagous  computations 

should  be  carried  out  for  <b  and  <t>  .  . 

or  not 

Gaines'  original  version  Is  directed  to  specifically  gener¬ 
alizing  Lukasiewicz  f^-logic  and  Reseller's  probability  logic,  so 

that  he  also  considers,  e.g.,  the  Lukasiewicz  implication  opera¬ 
tor.  Although  Lukasiewicz  -logic  corresponds  to 

=  (1  -  (•),  min,  max)  ,  its  implication  operator  in  effect,  in 

the  form  ^(u.v)  =  <t>or(l  ~  u,v)  ,  actually  corresponds  to 

$  =  bndsum,  yielding 

0_^(u,v)  =  min  (1  -  u  +  v,l)  , 

whence  equivalence  is  given  by  <]>  ^  ( u ,  v )  =  1  -  |u-v|  ,  for  all 

u , v  e  [0,1]  . 

Figure  A  presents  a  simplified  view  of  the  general  processing 
flow  of  acquired  information  and  the  relative  degrees  of  generali- 
ity  of  some  of  the  more  important  logical  systems  discussed 
previously  which  can  treat  this  information. 


1  -  5  Schefe's  agreement  probabi 1 1 1 1 es . 


Schefe  [224]  presents  both  criticism  of  Zadeh's  fuzzy  logic  and 
an  attempt  at  replacing  fuzzy  set  theory  by  a  system  closely  aligned 
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to  probability  theory. 

Schefe  desires  to  connect  Zadeh's  basic  (first  order)  fuzzy  set 
theory  with  probabilities,  but  rejects  the  fuzzy  set  membership 
concept.  In  place  of  membership-grade  mapping,  equivalently,  he 
replaces  the  values  of  a  membership  function  in  the  unit  interval 
[0,1]  by  subsets  of  the  unit  interval.  We  simplify  and  extend  his 
concepts : 

Let  X  be  a  fixed  space  and  define  set-valued  map 
f  :  »(X)  -»  B  [  0 , 1  ]  ,  the  real  Borel  field  over  [0,1]  ,  generated, 

e.g.,  by  the  class  =  {[0,x]  |  0  S  x  s  1}  ,  where  f  is  bijec- 

tive  (thus  limiting  the  possible  elements  of  9  (X)  )  ,  9  (X)  is  the 

class  of  all  (compound)  predicates  of  interest  and  ?  (X)  £  9  (X)  is 

such  that  f  is  bijective  over  »Q(X)  relative  to  ,  where  for 

any  a  e  9  (X)  ,  a  is  typically  of  the  form  (y  e  A)  (A  to  be 

c 

defined),  y  e  X  with  f  (a)  =  f  ((y  e  A)  )  =  [0,x  j  €  s,  .  Thus 
[ 0 , 1 ] -Lebesgue  probability  space  ([0,1],  B[0,1],  vol^)  induces 

probability  space  (9 (X) ,  f  *(»  [0,1]),  P)  ,  with 


P  =  vol  °f 
1  o 

called  the  agreement  probability. 

In  addition,  using  fQ  ,  in  a  natural  way,  we  can  define  oper¬ 
ations  v,  a  ,  C,  »  over  9  (X)  For  any  . e  ^(X)  , 

n  ,  ,  n 

v  a  .  =  f  (  u  f  (a.)) 
j=l  J  °  j=l  °  j 

n  .  ,  n 

^,aj  ■  ■ 

Ca!  4  C‘C  VV 

!,1  ’  V  4  C"  W  U  W1  •  etr-- 


Hence  f  is  a  lattice  isomorphism  for  v  ,  a  ,  etc.  Also 


n  n 

P(  v  a . )  =  vol  (  u  f  (a  . )  ) 
j  =  l  J  j  =  l  3 

P(C  a l)  =  vol1(C  fQ(a1))  ,  etc. 


a  <=  9  (X)  ,  it  follows  that 

l  no 


In  particular,  for  all  a 
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n 

P (  v  a  . )  =  max  x 
j  =  l  3  lsjsn  aj 

n 

P (  a  a  . )  =  min  x 
j=l  3  lSjSn  aj 

P  ( C  a  )  =  1  -  x 

1  ai 

P(a,  =>  a.)  =  min(l,  1  -  x  +  x  )  ,  etc. 

12'  a  a2' 

On  the  other  hand,  for  a  ,  a2  e  9(X)  with  f  (a^)  and 
fQ(a2)  independent  events  w.r.t.  vol^  , 

P(ax  a  a2)  =  P(a, ) • P(a„) 

P(a2  v  a2)  =  P(aj)  +  P(a2)  -  P(a1>-P(a2)  . 


In  turn,  define 


a  {X)  =  {A  I  A  =  (a  )  „  ,  a  «  *  (X)  ,  y  «  X)  , 

o  1  y  yeX  y  o  1 

s  a(X)  =  (A  |  A  =  (ay)yeX  ,  ay  €  »(X)  ,  y  €  X}  . 


Any  A  e  -Aq(X)  is  called  an  elementary  agreement  probabilistic 

set,  wnile  A  €  a(X)  in  general  is  a  compound  agreement  probabilis¬ 
tic  set  with  membership-grade  mapping  r  ■  X  -*  (P(a  ))  v  with  the 

a  y  y€  a 

following  operations  defined  over  aq(X)  :  For  any 


A  ^y’yeX  '  B  ~  ^y’yeX 


A  U  B  =  ( a  vb) 

y  y  ye  x 


define 


whence  for  any  y  €  X  , 


rAUB(y)  = 


!  P  (  a  V  b  )  )  v 
y  y  ye  x 


and  similarly  for  A  n  B  ,  CA  ,  A  »  B  ,  etc. 

Hence  it  follows  immediately  that  aq(X)  and  7(X)  are  bijec- 

tive  under,  mapping  t{-  )  which  is  also  an  isomorphism  w.r.t.  n 
and  u  .  But  note  that  c  and  n  ,  e.g.  .  preserve  the  Law  of 
Excluded  Middle  unlike  fuzzy  set  theory  or  more  generally,  for  gen¬ 
eralized  logical  systems. 

For  any  A  e  A  (X5 , 

r  AH  C  A  =  0  =  (0)yeX  ' 
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Similarly,  other  classical  relations  no  longer  true  in  fuzzy  set 
theory  remain  true  for  agreement  probabilistic  sets  and  their  oper¬ 
ations.  In  addition,  Schefe  investigates,  to  some  degree,  possible 
definitions  for  conditional  agreement  probabilities. 

Schefe  is  unaware  of  any  of  the  more  direct  natural  relations 
between  fuzzy  sets  and  random  sets  as  set  membership  functions  as 
well  as  with  the  evaluation  of  random  variables  over  compound  events 
([90],  [116],  [86]  and  Chapters  3-6). 

Schefe 's  criticism  of  Zadeh's  fuzzy  logic  is  illuminating: 

(i)  He  points  out  that  the  subjective  interpretation  of  a  fuzzy 
set  membership  function  (e.g.,  [105]  or  [71]  or  Chapter  8)  ,  <p.[x) 

at  some  point  x  e  x  ,  is  the  degree  of  certainty  or  precision  that 
x  has  property  A  interpreted  as  non-fuzzy  by  Schefe  -  indeed  in 
terms  of  has  agreement  probabilities  and  membership-grade  mappings, 
rather  than  the  interpretation  that  x  has  A  or  x  is  in  A  to 
degree  $  ( x )  .  Furthermore,  he  claims  that  taking  1  -  (• )  as  a 

standard  negation  operation,  1  -  $^(x)  is  to  be  interpreted  as  "x 

e  A  is  (  1-4.  (x)  )-uncfirt?:n"  but  not  as  "(x  t  A)  is  (1  -  4>.(x);- 
A  ** 

certain”.  Hence,  according  to  Schefe,  we  must  obtain  additional 

evidence  concerning  x  not  having  property  A  ,  even  when  we  have 

the  certainty  level  for  x  having  A  .  In  this  regard,  Schefe 

examines  Shortliffe's  certainty  factors  approach  to  combining 

evidence  [236]  (or  [120])  and  exhibits  similarities  and  differences 

(such  as  inducing  the  ability  to  handle  conjunctions  of  independent 

events)  with  his  agreement  probability  model.  (See  also  section 

8.3.) 

(ii)  Difficulties  occur  in  modeling  fuzzy  set  membership 
functions,  including  these  due  to  statistical  and  cognition  uncer¬ 
tainties.  But  again,  Schefe  fails  to  see  that  the  same  problems 
arise  in  the  modeling  of  probability  distributions.  Suitable 
parameterization  of  the  family  of  membership  functions  being  con¬ 
sidered  can  be  a  reasonable  approach  to  this  proolem  analogous  to 
classical  statistical  modeling. 

(iii)  However,  Schefe's  criticism  that  the  intersection  of  fuzzy 
sets  (under  min)  produces  subnormal  (i.e.,  maximal  value  of  the 
membership  functions  being  less  than  unity)  fuzzy  sets,  which  may 
not  reflect  the  linguistic  situation,  is  not  on  firm  ground.  (See  a 
similar  type  argument  in  10.2  A  (I).)  Consider  Schefe’s  example  of 
"large"  a  "not  large"  =  "moderate  size".  This  should  really  be 
replaced  by  "not  small"  a  not  large"  =  "moderate  size"  where 
"large"  *  "not  small".  Indeed,  Schefe  himself  points  out  Hersh  and 
Carramazza's  empirical  results  [109],  indicating  that  "not  large"  is 
less  precise  than  "small".  In  the  second  definition  of  "moderate," 
the  following  more  compatible  result  may  well  abtain: 
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(iv)  Problems  In  modeling  hedges  (as  well  as  negations)  are 
indicated  by  Schefe.  Zadeh's  square  transform  ([15],  e.g.)  or,  more 
generally,  exponential  transform  on  fuzzy  set  membership  functions 
to  indicate  an  intensification  such  as  "very  (  ) "  as  indicated  in 
[109]  by  emperical  studies,  probably  would  be  better  modeled  by  a 
translation  operation.  (See  also  section  9  (C).)  In  a  related 
vein,  as  pointed  out  above,  the  antonym  "not  small"  *  "large"  and  a 
new  operation  are  required  ■“  o  captive  the  lack  of  precision  in  "not 
small"  as  compared  to  the  simple  negation  of  "large".  (See  [199] 
for  some  related  topics.  See  also  the  comments  in,  2.3.8  (B)  .) 

(v)  Finally,  Schefe  points  out  some  inadequacies  in  fuzzy  set 
models  involving  vague  inference,  including  one  treating  the  pre¬ 
mises  "  x  knows  y” ,  ”y  knows  z"  with  fuzzy  conclusion  "x  may  know 
Z  ,  too".  However,  if  more  information  in  the  conclusion  were 
desired,  a  more  specific  binary  relation  should  be  used  such  as 

(x,y,z)  e  X  =  set  of  all  people  who  work  at  A  and  "know"  replaced 
by  "know  well".  As  premises  become  more  descriptive  so  do  conclus¬ 
ions.  Schef e 1 s  appeal  to  simple  statistical  correlation  as  a 
measure  of  friendship  itself  is  inadequate,  since  a  multiple  attri¬ 
bute  approach  would  probably  be  more  appropriate:  "  x  knows  y"  can 
be  measured  an  terms  of  frequency  of  lunches  together,  visitations 
to  common  work  areas,  number  of  items  borrowed  from  each  other  over 
a  time  period,  reference  to  each  other's  names,  lengths  of  natural 
conversations,  etc. 

Additional  remarks . 

Lake  [149]  has  suggested  use  of  the  function  (in  place  of  set) 
-  axiom  approach  of  Von  Neumann,  which  would  encompass  ordinary  set 
theory,  fuzzy  set  theory  and  multiset  concepts. 

Birkhoff  in  Chapter  10  of  his  basic  book  [17]  presents  a  most 
succinct  account  of  attributes,  propositional  calculus,  quantum 
mechanics,  modal  and  multivalued  logic,  and  probability  theory. 


CHAPTER  8 


INFERENCE  USING  GENERAL  LOGICAL  SYSTEMS 


This  chapter  is  devoted  to  the  development  of  inference  proce¬ 
dures  useful  in  knowledge-based  systems. 


8 . I  Knowledge  representation 

The  problem  of  analyzing  and  representing  states  of  knowledge 
has  been  basically  treated  in  AI  systems  as  in  classical  probabi¬ 
listic  approaches  by  establishing  models  using: 

(1)  Conditional  (possibilistic  or  probabilistic)  distributions 
(or  dispersions)  representing  e.g.,  in  medical  diagnoses  problems, 
relationships  between  patients'  symptoms  and  disease  states,  and 

(2)  Conditional  distributions  representing  observed  data 
errors . 

Tribus  [252]  considers  probability  assignments  to  information 
available  as  a  numerical  encoding  of  states  of  knowledge.  For  the 
linguistic  information  analogue,  possibility  assignments,  such  as 
proposed  by  Zadeh  [290],  may  suffice  as  the  numerical  evaluation  of 
states  of  knowledge.  See  also  Felsen  [57’]. 

Due  to  the  concept  of  a  general  logical  system,  as  developed  in 
Chapter  2,  both  Zadeh ' s  possibilistic  and  the  more  classical  proba¬ 
bilistic,  approach  to  knowledge  representation  may  be  treated  in  a 
general  unified  manner. 

As  mentioned  at  the  end  of  the  Preface,  this  treatise  is  con¬ 
cerned  with  developing  appropriate  models  for  handling  different 
types  of  uncertainty  involved  in  knowledge-based  systems,  and  is  not 
about  the  "theory"  of  knowledge-based  systems  per  se.  However,  we 
will  summarize  here  some  of  the  relevant  issues  involved  in 
knowledge-based  systems. 

In  addition  to  the  reference  cited  in  the  preface  ( [  1 2  1  ]  )  ,  the 
interested  reader  should  consult  Bellman  [13‘],  Davis  and  Lenat 

[ 44^v] ,  and  Rich  [216* ],  where  knowledge-based  systems  are  viewed  as 
part  of  AI .  In  addition,  the  work  of  Spiegelhal ter  and  Knill-Jones 
[246],  where  comparisons  between  AI  and  statistical  techniques  are 
presented,  may  be  applied  to  the  analysis  of  knowledge-based  sys¬ 
tems  . 

Knowledge-based  systems  are  basically  systems  which  can  use 
human  knowledge  (sometimes,  expert  knowledge)  to  perform  tasks 
through  some  reasoning  process.  At  an  "intelligent"  level,  these 
systems  should  be  able  to  do  things  which  require  some  form  of  human 
intelligence,  e.g.,  decision-making  under  uncertainty. 

As  stated  earlier  (see  section  2.1),  natural  language  is  a 
priviledged  channel  of  ''ommunications  of  humans,  so  it  is  therefore 
reasonable,  ior  example,  to  consider  understanding  natural  language 
as  an  appropriate  (albeit,  ideal)  goal  for  machine  intelligence. 


475 


476 


Goodman  and  Nguyen 


This  requires  the  modeling  of  knowledge  acquisition,  knowledge  re¬ 
presentation,  and  knowledge  utilization.  (In  this  text,  emphasis  is 
mostly  upon  the  latter  two  areas.)  Often,  part  of  the  modeling  is 
based  upon  knowledge  gleaned  from  experts  -  leading  to  the  design  of 
"expert"  systems.  On  the  other  hand,  mathematical  logic  can  also  be 
used  in  the  modeling,  especially  in  the  representational  aspect. 
However,  as  Rich  [216']  points  out,  it  may  be  more  appropriate  to 
use  more  flexible  logics  than  simply  the  classical  one,  such  as 
Probability  Logic,  Fuzzy  Logic  or  some  "non-monotone"  logic.  For 
example,  in  the  case  of  incomplete  information,  a  type  of  "default" 
reasoning  is  suggested. 

At  this  state-of-the-art,  basic  objectives  in  the  field  of 
knowledge-based  systems  can  be  handled  by  declarative  methods 
(consisting  of  facts  and  procedures/rules  for  manipulating  facts)  or 
by  procedural  methods  (where  knowledge  is  represented  as  procedures/ 
rules  for  using  it).  Examples  of  declarative  representations  of 
knowledge  are  conceptual  dependency,  frames,  semantic  networks,  and 
scripts.  Default  reasoning  and  probabilistic  reasoning  can  be  con¬ 
sidered  as  procedural  methods.  Often,  a  combination  of  the  two 
above  methods  is  used.  Knowledge  representation  structures  can  be 
syntactically  oriented  -  where,  e.g.,  classical  predicate  calculus 
and  logic  are  utilized  -  or  can  be  more  semantically  oriented  - 
where  conceptual  dependency,  frames  and  semantic  networks  are  em¬ 
ployed,  etc.  In  a  related  vein,  Zadeh 1 s  PRUF  [283]  approach  should 
be  again  mentioned. 

Analysis  of  knowledge-based  systems  also  requires  the  modeling 
of  perception,  judgment,  integration  of  knowledge,  and  cognitive 
processes  in  general  [98’],  [25],  [38],  [157],  [274’].  Another 
related  area  Involved  in  knowledge  modeling  is  that  of  the  mental 
representation  of  concepts  and  concept  prototypes.  See,  e.g.,  Cohen 
and  Murphy  [40‘ ]  also  Osherson  and  Smith  [198’],  where  the  theory  of 
fuzzy  sets  has  been  considered.  Specifically,  a  concept  is  regarded 
as  a  definition  of  a  term  (in  some  natural  language)  or  a  statement 
of  the  defining  conditions  for  membership  in  the  class  designated  by 
the  term  .  A  theory  of  prototype  was  suggested  as  follows:  a  con¬ 
cept  C  (In  a  given  natural  language),  e.g.,  "bird",  "red",  "table", 
"red  table",  is  not  sharply  defined,  and  hence  should  be  considered 
as  a  fuzzy  set  of  some  universe  of  discourse,  say,  a  base  space  X  , 
elements  of  which  are  objects  related  to  C  .  The  notion  of  member¬ 
ship  in  C  is  specified  by  a  membership  function  $£  :  X  -»  [0,1]  . 

A  special  member  x  e  X  Is  chosen  based  upon  some  criterion  deter- 

o 

mined  by  a  given  metric  d  on  X  .  This  special  xq  is  called  the 
concept's  prototype.  Thus  a  mental  representation  of  C  is  a  quad¬ 
ruple  of  the  form  <X,  D,  x  ,  <)>_>  =  MR(C)  .  In  order  to  represent 

o  c 

more  complex  concepts,  in  natural  language,  (e.g.,  as  expressed  by 
adjective-noun,  noun-noun,  such  as  "ocean  drive",  "night  flight", 
etc.),  the  problem  of  conceptual  combination,  based  on  generalized 
set  theory,  has  been  discussed.  As  was  pointed  out  in  [198'],  there 
is  a  need  for  considering  different  logical  systems  other  than 
(Until,  ii&ii  ,  II  oril  )  =  ( 1  —  (  •  )  ,  min,  max)  .  Nevertheless,  the  negative 
conclusions  of  [198']  concerning  the  use  of  fuzzy  set  theory  due  to 
apparent  "fallacies"  is  basically  unfounded.  (See  section  10.2(A) 
for  further  discussions.) 
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8 . 2  A  decision  theory  based  on  dispersions. 

A  generalized  decision  theory  based  on  semantic  evaluations  of 
possible  losses  may  be  developed,  extending  the  classical  notions  of 
probabilistic  decision  theory  and  statistical  inference. 

Let  X  be  a  fixed  parameter  space  and  D  a  fixed  decision 
space  and  suppose  L  and  A  represent,  respectively,  loss  and  true 
values,  jointly  relative  to  X  x  D  .  Indeed,  assume  L  and  A  are 
both  generalized  subsets  of  X  x  D  ,  where 


4>l  ,  <t>A  :  X  x  D  -»  [0,1]  . 


In  turn,  expected  loss  in  ordinary  decision  theory  is  replaced  by 
the  semantic  evaluation  for  any  t  e  D  , 


E(4>(d|  .  }  )  =  ii  Ox)  (  (x,r  )  e  L  n  A)  )  n 
X€X 

=  ^or  (<^&(<^(L|y)  (r  ^  '  *(D|Y)(r)'  VY)  >  >  ' 
ye  Y 

assuming  the  Y-projection  may  be  consistently  carried  out  (see 
section  2.3.8  E),  where 

<fl(L|Y)(7’)  =  <t>or(<f’&(<J)L(x'r }  '  ^(C|y)(x,))  ' 
xeX 


where  it  is  assumed  for  all  reD.yey.xeX, 

( A | y ) | r ) (X)  S  ^(C |y) (X) 
and  :  Y  -*  [0,1]  is  given. 

* 

Thus,  the  Bayes  decision  function  $  will  satisfy 

(  D  |  •  ) 

E(<J)(D|-  =  E(<P  ( D  |  -  )}  ’ 

4>(D|  •  ) 

where  for  all  y  e  Y  , 


<P 


(D|y) 


:  T  )  =  « 


r  ,  r. 


(Krdnecker  delta), 


where  r is  determined  from 


'  <  L I  y ) ( r  y }  <f,(L|y)(r)  • 


For  example,  analagous  to  ordinary  decision  theory,  if 
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I*a(x,r)  = 

r  i 

if 

II  x-t  II  i  a 

• 

l  o 

if 

II  x-r  II  >  a 

then  in  a  limiting  sense, 

as 

a 

approaches  zero,  the  Bayes  decision 

function  which  yields  inf  E^(D|-}^  is 

*<D|-) 

»(D|y)'r>  *  V»(c|y)  ’  f°r  311 

For  simple  hypoth  s  testing 


L  ( x ,  t  )  =  a  > 

0  , 

iff 

x  s  Hj  r  = 

H 

o 

L  (  x ,  r  )  =  b  > 

0  , 

iff 

x  e  H  ,  r 
o 

-  Hi 

L(x,r )  =  0 

f 

iff  x  and  t 
otherwise . 

are 

where 

X  =  Hq  0  Hj  ,  D  =  {Ho,H1}  , 

the  Bayes  decision  function  is  equivalent  to  deciding 
/  ft 


V'*or  {*(C|y)(x))>  *  Vb'*or  <*<C|y)<x>»  ' 

(xeH1)  (x€Hq) 

noting  the  further  simplication  of  omitting  ,  a  ,  and  b  ,  when 
a  =  b  =  1  . 

Analogues  to  several  concepts  in  ordinary  statistical  theory 
may  be  developed  here,  including  maximum  likelihood  estimators, 
likelihood  comparisons,  invariant  estimators,  Neyman-Pearson-like 
hypotheses  tests  and  confidence  sets.  However,  these  will  differ  in 
form  -  in  some  cases  considerably  -  due  to  the  lack  of  the  con¬ 
straint  of  summing  to  one  for  the  possibility  functions  involved  and 

due  to  the  general  forms  for  $  +.  ,  <f>r  ,  d>  .It  should  be  noted, 

not  6c  or 

if  $no ^  =  1  -  ( • ) ,  $&  =  prod,  and  <f>Qr  =  bndsum  ,  and  the  possibi¬ 
lity  functions  representing  parameters  and  data  are  probability 
functions,  then  ail  of  the  above  decision  theory  concepts  reduce  to 
the  classical  probabilistic  counterparts. 

The  above  development  in  general,  is  in  contrast  to  other  at¬ 
tempts  to  "fuzzify"  ordinary  decision  theory,  by  replacing  ordinary 
sets  or  operations  involved  by  fuzzy  ones.  (See  Dubois  and  Prado 
[51],  pp.  277-296.) 

To  illustrate  the  above  remarks,  consider  the  following  example 
of  the  construction  of  a  possibilistic  confidence  set. 

p 

Let  unknown  parameter  9  €  x  =  D  =  R  ,  and  suppose  observed 
data  vector  y  €  y  =  Rm  ,  where 

y  =  f (9)  +  v  ;  all  9  e  X  ,  all  v  e  y 
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with 


where 


i  .  e .  , 


f  :  X  -♦  Y  known. 

9  :  Y  -*  X  estimating  function  being  h-invariant, 

h  :  Y  -»  X  known. 


9 (y)  =  9  +  h(v)  ;  all  9  e  x  ,  v  e  Y  . 

Let  v  have  possibility  function  not  dependent  on  e  . 

Define,  for  any  possible  9  e  X  ,  c  >  0  ,  region  R°  £  X  by 

B  ,  C 

Ro,c  =  {9  |  9  €  X  &  |  |  |  9  —9  HI  S  C)  . 

|  |  |  •  |  |  |  denoting  vector  norm  (while  ii  •  n  denotes  truth  value)  . 
Then,  for  all  9  €  x  , 

11(9  «  R  1 9)1  =  II  Or  (9  e  R  &  y  =  y)ii 

9  ( • ) .c  all  y  9 (y) ,c 

=  il  Or  (y  =  y)n  =  ii  Or  (v  =  v)  ii 

{ Y t  I  I  |e (Y)-fl I  I  I  s  c)  [i  |  | | \v | | |  s  c) 

=  4>or  (^(S))  =  ♦B(A)(c)  ’ 

<3|  1 1 13|  1 1  s  c> 

a  result  not  dependent  on  9  ,  assuming  the  truth  functionality  of 

evaluations  relative  to  <t>  .  Thus,  ,  :  IR+  -♦  [0,1]  is  mono- 

or  B ( A ) 

tone  increasing,  in  general. 

Hence  for  any  0  <  a  <  1  ,  defining  Ca  by 


R e(.)  q  Is  a  (1  -  a)-level  possibility  confidence  region  for  9  , 

analagous  to  the  classical  statistical  concept.  Especially  note, 
the  linear  regression  reduction  when  B  is  m  by  p  of  rank  p  , 

h(u>)  =  a-v  ,  where  a  =  ( Br  Cov”1  ( v  )  B )  _1Br  Cov- 1  ( u  )  ,  4>  =  bndsum  , 

or 

etc.  (See,  e.g.,  C.R.  Rao  [212].) 
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8.3  Combination  of  evidence  problem. 

Knowledge  representation  techniques  play  an  important  role  in 
the  design  of  expert  systems;  see,  e.g.,  Nau  [183]  and  Davis  and 
Lenat  [44'  ].  At  the  control  level,  the  problem  of  combining  of 
evidences  becomes  essential  in  the  technique  known  as  problem 
reduction,  e.g.,  in  MYCIN  (see  Shortliffe  and  Buchanan  [236])  where 
each  problem  can  be  decomposed  into  several  simpler  sub-problems. 

Formally,  Bayes'  theorem  for  manipulation  of  conditional 
probabilities  (in  statistical  systems)  provides  a  rigorous  proba¬ 
bilistic  analysis. 

Recall  that  the  probabilistic  combining  function  is  derived  as 
follows  (see  also  Ishizuka,  et  al . ,  [121]): 

Let  (fl,A,Pr)  be  a  probability  space  and  A,B,D  e  a  such  that 

Pr(A) ,  Pr(B),  Pr(D)  and  P ( D )  (where  D  is  the  set-complement  of 
D)  are  all  positive.  Then  we  have: 


Pr ( D  |  A  n  B)  = 


+  Pr ( D | A ) Pr ( D | B )  _  Pr(D) 
Pr ( D | A ) Pr ( D | B )  Pr ( 5 ) 


provided  Pr ( AHB | D )  =  Pr ( A | D ) Pr ( B | D )  and 
Pr (APB | D )  =  Pr( A | D) Pr ( B | D )  . 

Indeed , 

Pr  ( D  |  AnB )  -  Prlrr^r<DI 

_  Pr(A|D)Pr(B|D)Pr(D)  _  Pr ( D | A) Pr ( D | B )  Pr(A)Pr(BJ 
Pr (AHB)  Pr ( D )  '  Pr(AnB) 

But  Pr(APB)  =  Pr ( AHB | D ) Pr ( D )  +  Pr ( APB | D ) Pr ( D ) 


Pr ( D | A ) Pr ( D | B )  +  Pr ( D | A) Pr ( D | B ) 
Pr ( D )  Pr (D )  j 


Pr (A)Pr(B) 


Thus  : 


Pr ( A ) Pr ( B ) 
Pr(APB) 


[  Pr ( D 1  A ) Pr ( D | B )  +  Pr ( D | A ) Pr ( D | B )  ] 
i  Pr ( D )  Pr ( D )  J 


In  medicine  for  example,  due  to  the  lack  of  sufficient  data  and 
the  imperfectness  of  judgmental  knowledge,  one  is  led  to  consider  a 
more  realistic  measure  of  uncertainty  known  as  the  carta i nty  factor 
(Shortliffe  and  Buchanan,  [236])  which  is  an  approximation  of  condi¬ 
tional  probability.  The  problem  of  combining  of  certainty  factors, 
or  of  other  uncertainty  measures,  is  important  for  decision  process¬ 
es.  Dempster  [46],  followed  by  Shafer  [228],  generalized  classical 
Bayesian  analysis.  Dempster  in  essence  worked  with  multi-valued 
mappings  (or  random  sets),  while  Shafer  presented  an  axiomized 
version  of  the  approach  in  the  form  of  his  "belief  functions".  Both 
addressed  the  problem  of  combination  of  evidence  based  upon 
Dempster's  earlier  work,  Dempster  [46].  The  Dempster-Shaf er  theory 
of  belief  functions,  which  claims  to  provide  a  mechanism  for 
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handling  doubt,  ignorance  and  conflicting  evidence,  is  receiving 
increasing  attention  in  the  AI  literature.  (See  Spielgelhater  and 
Knill-Jones  [246].) 

However,  it  should  be  emphasized  that  Dempster-Shaf er  theory  is 
completely  analyzable  through  the  use  of  random  sets  and  the  Choquet 
capacity  theorem  (see  section  3.2  or  Nguyen  [187],  etc.).  For 
example,  the  Dempster-Shaf er  rule  of  combination  of  evidence  is  in 
actuality  the  superset  coverage  function  of  the  intersection  of 
statistically  independent  random  sets  (see  Goodman's  comments  [90] 
or  the  remark  following  Theorem  5,  section  4.2). 

Let  us  return  to  the  concept  of  certainty  factors. 

Let  Pr)  be  a  probability  space.  For  an  event  H  e  a  , 
considered  as  a  hypothesis,  and  an  event  E  e  a  ,  considered  as  an 
evidence,  the  'certainty  factor  for  H  in  the  light  of  E'  is 
defined,  assuming  0  <  Pr(H)  <  1  , 


Pr (H| E) 

-  Pr(H) 

,  if  Pr ( H | E ) 

>  Fr(H) 

1  - 

Pr  ( H ) 

Pr ( H | E) 

-  Pr(H) 

,  if  Pr ( H | E ) 

<  Pr ( H ) 

Pr  ( H ) 


Remark : 


The  set  function  a  :  a  ff?+,  where 

a(A)  ~  —  Pprtl)  :  A  e  •*  •  Pr(A>  *  1  - 

is  the  (prior)  odds  ratio  of  the  event  A  .  The  conditional  odds 
ratio  of  A  given  B  is 


aB(A) 


Pr  ( A  |  B  ) 

1  -  Pr (A)  ' 


Note  also  that,  when  Pr(H|E)  2  Pr(H)  , 


CF ( H , E )  =  a£ ( H )  -  a(H)  . 


Heuristically ,  the  same  interpretation  of  CF  applies  to  the 
case  Pr(H|E)  <  Pr(H)  with  appropriate  odds  ratios. 

The  set-function  a  satisfies  the  following  properties: 

( i )  a (A) -a ( CA)  =  1  ,  V  A  €  A  , 

(ii)  A, Be  A  ,  AS  Bs  a( A)  S  a  (B) 

( iii )  A,B  «  1  ,  A  n  B  =  03  a(AUB)  2  n (A)  +  a< B) 
(superadditive) , 

(iv)  If  A  ,  B  e  a  and  A  T  (Resp.  B  1)  ,  then 

n  n  n  n 

n 

at  ( A^ )  t  a(UAn)  (resp.  a  ( Bn )  A  a(OAn)  (sequential 
ascending  and  descending  continuity) . 


i 
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We  can  extend  a  from  a  to  9  (h)  by  replacing  Pr  in  the 
definition  of  a  by  its  induced  outer  measure.  If  fi  is  a  topo¬ 
logical  space  and  A  is  its  Borel  o-algebra,  then  it  can  be  veri¬ 
fied  that  a (or  a  )  is  a  Choquat  capacity. 

hi 

Now,  from  the  definition  of  CF(H,E),  the  problem  of  combina¬ 
tion  of  evidence  -  namely,  given  C(H,E^)  ,  i  =  1,2,  say,  find 

CF(H,E1  and  E 2)  -  is  centered  around  the  following  problem  of 


calculus  of  probabilities:  given  Pr(H|E^)  ,  i  =  1,2,  find 
Pr(H|E1  n  E2)  . 

Assuming  independency  between  evidences,  i.e., 

PrfE^  n  E 2 ] H )  =  Pr ( E  | H ) Pr ( E2 | H)  ,  an  ad  hoc  combining  function  was 

proposed  in  MYCIN  [?36]  in  the  following  "truth  functional"  form: 


where 


g(x,y)  = 


CF  ( H 

'E1  n  E2>  = 

g[CF(H, 

V 

,  CF(H,E2)] 

1 

if 

X 

=  1 

or  y  =  1 

d 

X 

+  y  -  xy 

if 

X 

>  0 

and  y  >  0 

X 

+  Y 

if 

X 

*  i 

1  ,  y  *  ±  1 

X 

+  y  +  xy 

if 

X 

<  0 

and  y  <  0 

.  - 

1 

if 

X 

=  -1 

or  y  =  -1 

and  xy  <  0 


As  it  was  pointed  out  in  Ishizuka,  et  al .  [120],  CF(H,E1  n  E2)_, 

which  is  obtained  by  using  the  above  combining  function,  is  only  an 
approximation  since 


g[CF(H,E1) 


CF(H,E2)]  * 


Pr(H|E1  n  e2)  -  Pr ( H ) 

-  -  pr(Hj 


A  consistent  combining  function,  always  holding  in  the  case  of 
independent  evidences,  was  proposed  in  Ishizuka,  et  al . ,  [120].  The 

combining  function  turns  out  to  be  a  function  of  CF(H,E^),  i  =  1,2  , 

and  of  the  prior  odds  ratio  a(H)  .  See  also  Ishizuka,  et  al  .  , 
[121]  for  the  problem  of  combination  of  evidence  using  Dempster  and 
Shafer's  theory. 

In  summary  it  has  been  proposed  to  combine  evidence  in  the 
forms  of: 

(1)  conditional  probabilities  using  the  classical  Bayes' 
theorem , 

(2)  belief  (and  related)  measures  using  Dempster-Shaf er ' s  rule 
of  combination  evidence, 

(3)  certainty  factors  (as  approximating  to  conditional  proba¬ 
bilities)  using  ad  hoc  procedures. 

Typically,  in  applying  Dempster-Shaf er  theory  to  problems  of 
combining  ev)^°n^»  nr  in  estimating  unknown  parameters,  the  relevant 
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belief  measures,  doubt  measures,  plausibility  measures,  etc.,  must 
be  known  exactly  or  up  to  some  class.  (See,  e.g.,  Shafer  [228].) 
However,  the  results  of  section  3.2  show  that  there  is,  up  to  dis¬ 
tributional  considerations,  bijective  relations  between  such 
uncertainty  measures  and  corresponding  random  sets,  and  as  well, 
between  belief  and  doubt  measures,  doubt  and  plausibility  measures, 
etc.  In  practice,  it  may  be  too  difficult  to  obtain  the  necessary 
probability  measures  corresponding  to  the  random  sets  of  interest. 
This  is  often  due  to  combinatorial  considerations:  for  a  space  X  , 

2  '  '  =  Card  ?(X)  probability  values  are  involved  for  S  ,  any 

random  subset  of  X  .  A  reasonable  weakening  of  the  above  require¬ 
ments  is  a  specification  of  only  the  k-coverage  functions  of  the 
random  sets  involved  for  some  suitable  finite  k  .  In  particular, 
k  =  1  yields  the  weakest  level  of  specifying  information,  k  =  2  , 
less  weak,  etc.  See  sections  5.2,  5.3  for  further  details  concern¬ 
ing  the  spectrum  of  specification  of  random  sets  through  their 
multiple  point  coverage  functions.  Also,  it  has  been  shown  (section 
5.2)  that  the  k-point  coverage  determination  of  random  sets  is 

essentially  equivalent  to  the  specification  of  a  corresponding 

multiple  membership  function,  i.e.,  the  semantic  evaluation  of 
the  distinguished  k-place  signature  membership  predicate  symbol 
e  .  .  between  k  variables  and  k-place  attributes  of  gen- 

1 1X  "  X  k 

eralized  sets.  In  particular,  the  case  k  =  1  corresponds  to  the 
simple  membership  symbol  e  ^  (or  just  €)  .  (See  section  2.3.5.) 

With  the  above  in  mind,  we  can  consider  for  purposes  of  sim¬ 
plicity  only  the  case  k  =  1  and  develop  an  approach  to  the 

combination  of  evidence  through  simple  dispersions  representing  the 

semantic  evaluation  of  the  relevant  membership  relations  between  the 
unknown  parameters  of  interest  and  their  attributes. 

Finally,  it  should  be  noted  that  this  approach  is  a  general¬ 
ization  of  the  case  involving  conditional  probabilities,  when 
Probability  Logic  is  used  for  the  semantic  evaluations. 

The  following  approach  to  the  combination  of  evidence  within  a 
general  multi-valued  logical  set  theory  conti xt  may  be  modified  in  a 
straightforward  way  for  changes  in  the  assumptions.  Here  it  is 
assumed  that : 


(a)  A  collection  of  joint  posterior  membership  functir.s 

°k 

is  available: 


<fc  ^  k '  zk 
k 


V 


V 


where  C,  e  7  ( X  x 
k 

unknown  parameter  vector 
parameter;  and  y,^  e  y^ 
the  collection  index  set 


;  k  e  J  , 

0^  <=  X  represents  the  common  values  of 
0  e  X  ,  z^  e  represents  a  nuisance 

represents  data,  all  for  source  k  e  J  , 
of  sources.  J  may  be  finite  or  infinite. 


(b)  A  fuzzy  set  system  ^nt'^&’^or^  is  chosen  for  the 

logical  operators.  More  generally,  a  single  logical  system  or  a 
collection  of  logical  systems  cculd  be  chosen,  but  for  simplicity, 
only  the  former  will  be  treated.  For  justification  for  choosing  a 
particular  fuzzy  set  system  (or  systems),  see  Chapter  9. 

Then,  assuming  no  interactions  from  the  overall  joint  posterior 
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function, 


$  (£,z|y)  =  "(£,z)  €  x  c.  |  yii 

o  ke  J 

=  (ek'zk>Yk))  • 

kej 

(=  4>  c  (£ -zly) )  - 


where 

~  =  (ekJk€j  '  ~  =  ^k’keJ  :  y  =  (yk}k«J  ' 

In  turn,  the  nuisance  parameters  may  be  eliminated  by  the 
projection 

$  (£  |y )  =  ii  Or  ($c  (£  ,z  |y )  )  it 

.  .  zeD  o 

pro j  (C  )  ~  o 

X 


*QP  (4>r  (£.z|y))  - 


Tor  ,TC 

zeD 
~  o 


where 


D  =  x  D. 
°  ke  J  * 


The  unnormalized  diagonal  set  dlag  e  ?(X)  is  determined  by  the 
constraint 

♦diag^^1  =  (Voj  (C  )<Sly>>  '  Sk  =  0  '  k  €  J  :  a11  8  e  X  • 

X  o 

Let  univ  e  ?(?(X))  be  a  universal  set  representing  all 
possible  (fuzzy)  sets  describing  8  .  Then  at  diag  , 

$  .  (diag)  =  o  ( <±> , .  (e|y))  . 

Tuniv'  Tor'Tdiag'  '■* 

9  €  X 

Noting  the  identification 


-univ<e'dia9ly’  =  ^diag^8 


rdiag-univ 


the  desired  conditional  possibility  function  $(8|diag,  y)  is 
obtained  from  the  relation 

^diag'01^5  =  Idiag,y)  .  *univ ( diag ) )  . 


In  particular,  suppose  :  X  =  IR^,  J  finite,  =  prod, 
<PQr  ~  probsum,  Dq  =  0  , 
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*(flk  I  yk  :  k)  =  vy9k  -  Pk(yk).  Ck)-£(9k) 

where  is  the  j-dimensional  multivariate  Gaussian  p.d.f., 

Pk(Yk)  is  an  estimator  of  the  true  value  for  0k  <  ck  is  a 
corresponding  covariance  matrix  of  error,  and  A^(g^)  is  the 
X-volume  of  a  sufficiently  small  X-cube  (cell)  Ck(0k)  surrounding 
0k  €  X  ,  k  €  J  . 

It  then  follows  that  (approximately,  becoming  more  accurate  as 
Jk(ek}  "*  0  '  k  €  J) 

§(0  |  diag ,y ) 

=  y(9  -  JJ(  Y)  ,  c)-^k(ek)  , 


where 


p(y)  =  C-  i  (C"1-**.  (y.  )  )  , 
ke  J  K  *  K 

c  =  r  c~1j~1  . 
ikej  k 

Note  that  (p(y),Cj)  represents  the  standard  BLUE  (best  linear 

unbiased  estimator)  statistical  estimator  of  e  . 

Consider  again  the  p  's  in  general  where 

Ck 

*C  (®k-  zk  I  Yk)  ;  all  0k  €  x,  zk  €  2k  ,  k  e  J  ,  with  all  of  the 
above  definitions  holding. 

The  following  theorems  are  direct  consequences  of  these 
definitions  (see  Goodman  [86]): 


Theorem  1 


If  J  =  (1,2,3 _ )  , 

0  =  (0  |  0  e  X  &  lim  <f>  (e,  2fc  |  yk)  =  1}  , 

k-*-H=  K 

for  all  z  «  D  ,  and  if  4>P  is  an  Archimedean  t-norm  (see  section 
o  & 

2.3.6),  the** 

^diag(®  l*J  =  0  ;  a11  9  €  x  • 

Remark  1 . 

If  cf>&  =  min,  then  in  general 
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4>diag(9 l^)  >  0  ;  a11  9  fe  x  • 

Recall  from  section  (5.2)  the  bijective  relation  between  flou 
classes  (or  collections  of  nested  level  sets)  and  membership  func¬ 
tions.  Thus 


ncoly|  a  U)«IOil] 


corresponds  uniquely  to 

o 

Define  also  for  any  t-norm  the  flou  class  of  conjunctive 

information  as 


MIC,  <f&|y) 


Then 


.kej  °k 


ae [0, 1] 


Theorem  2 


r(CQ|y)  s  M(C,<t>&|X)  , 

for  all  y  ,  with  equality  holding  iff  <J>&  =  min  . 


Remark.  2 


In  a  sense.  Theorem  2  indicates  no  information  loss  when 
<f>&  =  min  is  chosen.  (See  Goodman  [86],  [82],  [91*  1  *  ]  for  related 

resul ts  .  ) 


8 . 4  Some  asymptotic  results 

Most  results  involving  general  logical  systems  and  fuzzy 
systems  are  concerned  with  fixed  finite  structure?. 

In  response  to  the  criticism  of  Arbib  [6]  and  Manes  [170], 
[168]  concerning  the  lack  of  asymptotic  results,  Dishkant  [48]  has 
made  progress  in  addressing  this  issue.  He  shows  for  general  fuzzy 

set  system  7"  ^  ( 1  — ( •  )  ,  minsum,  max)  ,  for  any  sequence 

(Aj)j  =  12  °f  fuzzy  subsets  of  ir  with  sufficiently  well- 

behaved  derivatives  for  ,  which  have  only  one  modal  point 

Aj 

yielding  value  one  and  having  finite  interval  support, 

(  *IA  (x)  -  p  (x)  ) j ,  .  .  .  -  0  ' 
j<n) 

uniformly  in  x  ,  for  all  x  «  (R  ,  where 

p(x)  =  max  (1  -  (c/2)(x  -  a)2,0) 
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~  exp  (-(c/2) (x  -  a)  )  ; 

for  constants  c  >  0  and  a  e  ir  which  functionally  depend  on  the 
"well-behaved"  conditions  of  the  derivatives.  In  addition,  Dishkant 
discusses  estimation  of  a  and  c  from  empirical  evidence. 

Concerning  Manes'  last  criticism,  see  section  2.3.7  where  two 
justifications  for  the  extension  principle  -  in  its  general  form  - 
are  given. 

It  also  can  be  shown  there  are  general  fuzzy  set  system 
analogues  of  the  Law  of  Large  Numbers  and  related  results.  Speci¬ 
fically,  the  following  properties  may  be  easily  obtained: 

(1)  Following  the  extension  principle,  justified  by  multivalued 
logical  considerations,  let  A^,  be  arbitrary  fuzzy  subsets 

of  X  S  ir  ,  where  X  is  a  discrete  (finite  or  countable)  space. 

Then,  defining  the  function  h  :  iRn  -*  IR  ; 

n 

d  1  n 

h  (x  ,  .  .  .  ,  x  )  =  -  2  x.  ,  all  x,  ,  .  .  .  ,  x  e  ir  , 

n  1  n  n  l  In 

the  mean  or  least  squares  estimator,  etc.,  without  any  appeal  to 
statistical  theory.  Then  for  any  y  e  ir  , 


\(A1 . An)<YJ  =  “<Y  «  W 


.An))" 


"  ♦or  V*Al<xl>* 


(x1#  .  .  .xn)  «  h^fy)  n 


•  *A  (V> 


For  any  «  >  0  ,  consider: 


( |y-pnl*a) 


( a . A nj<y) 

n  l  n 


=  I  Or  ( | y-w  |  k  a  &  y  €  h  (A  . A  ) ) |  ,  for  all  y  e  X  ,  (1 

xe  X 


where 


■  V 


d  1  _ 

=  -  T  M  ,  , 

n  n  j=l  j 


1  ’  r  2  ’ 


a  sequence  of  constants  in  R  .  Note  that  the  number 


of  arguments  of  $  is  at  most  countable  here  (while  that  of  $ 

or  & 

is  finite). 


Thus,  if,  e.g.,  are  some  measures  of  central 

terdency  such  as,  e.g.,  means 
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P,  -  r  x.$  ,  (x)  , 

J  X€X  j 


where  for  ill  x  e  R  , 


(>A,  (x)  =  d/^  (x)  * 

3  J 


and 


•»  . 

1 


d 


z  t 

X€X 


A  . 
1 


(X) 


is  assumed  finite;  j  =  l,2,...,n  , 

then  eq.(l)  represents  the  general  fuzzy  set  extension  of  the 
probability  form 


Pr(  |Xn->V  *  €  >  ' 


where 


x„ a  Vxi . V 


random  variable  X.  having  as  its  probability  function, 

2  Aj 

j  =  1,...,^  with  X, . Xn  being  statistically  independent.  As 

before,  merely  specialize 

( <|>nt  >  <or)  =  (l-(*)»  prod,  bndsum)  (bndsum  =  sum  over  proba¬ 

bility  functions)  to  obtain  the  probability  analogues. 

Assume  now 


(i) 

V 

•»„,  ...,5-r  <+<», 

2  O 

(ii) 

v2  ■  ■  •  • '  <  +  00  ' 

(iii) 

2 
o  . 

J 

=  i  (x-fJ.)2  <p  ,  (x)  <  a2  <  +  ®  , 
X€X  2 

,  j  =  1,2, 

(iv) 

is  any  t-norm  bounded  above  by 

prod  , 

(v) 

Tor 

is  any  copula. 

Then  for  any  8  >  0  , 
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X1 ' ' ‘ ’ ,xn)€  hn1 (y5nx  1 

lY-^nlia'Y€X 


(,V(Xl)',<|,An(xn,)  * 


cb  s  bndsum  s  sum 
Tor 


2  _  ^A1  {xi: 

(x1 . xn)€  h”1  (y)nxi  1 

|y-Pnl*a<  veX  J 


*A'  (xn> 


Pr( iXn  ‘  V  “  5) 


noting  that  now  Xn  is  the  sample  mean  of  statistically  independent 

random  variables,  which  by  use  of  Chebyshev's  inequality  yields  the 
standard  upper  bound  for  Weak  Law  of  Large  Number  convergence. 
(See,  e.g.,  Revesz  [216].) 

Note  that 

(a)  If  all  ij,  ij,  ...  <  i  , 

i.e.,  <t>.  ,  <t>.  ,...  all  represent  deficient  probability  functions, 

1  A2 

then 

0  =  lim  Jn , a  S  °(7on/n)  ' 
n-»® 

better  than  exponential  rate  convergence  to  zero. 

(b)  If  all  if  a  =  1 2  =  •  •  •  =  l  , 

i.e.,  <}>fl  ,  <}>  ,...,  all  represent  ordinary  probability  functions, 

A1  A2 

then 


0  =  lim  J  .SO-), 
n ,  c  n 

n-»® 


(c)  However,  if 


71'  7  2  '  2  1  ' 


then  the  upper  bound  on  Q  diverges, although  for  any  fixed  n 
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by  choosing  a  >  0  suitably  large  Jn  5  may  be  made  small. 

(2)  The  last  situation  can  be  improved  upon  by  considering  first 
the  continuous  analogue  of  (1)  and  within  it  the  class  of 
Gaussian-like  fuzzy  set  membership  functions: 

Suppose  t-norm  <|>&  is  arbitrary  differentiable  in  all  of  its 

arguments  at  0  and 

<K  .  <P,  -  :  R  -  [0,1]  , 

1  A2 


with  s  iRn  ,  an  open  connected  domain. 

Also,  for  each  m  k  1  ,  define  for  any  >  0  ,  k  =  1 

n  >  1  , 


v„ 4  «  1  1  -  °' 1  1 =■•  •••> 


and 


K  s  b  n  M 
m,n  n  m,n 


,  n ; 


Suppose  the  grid 

lim  max  (A  ^ , . . . )  =  0 
m-»a> 

for  any  mil.  Finally,  suppose  ()>or  is  any  Archimedean  copula 
(see  2.3,6)  which  implies  there  is  a  decreasing  convex  h 


h  : 

[0,1] 

-♦  R  + 

;  h  ( 1 )  =  0  , 

such  that  for 

all 

Yr  ... 

■  ,Yn  *  [0,1]  , 

n 

<J>  i 
Tor 

(Y1(.. 

•'V 

=  1  -  h  (min(h( 0) , 

z 

j  =  l 

Ml  -  Yj  )  )  ) 

assuming  also 

h(l 

-  U) 

is  di f f erentable  in 

u 

at  o  . 

Then  for 

any 

n  t  1 

,  defining 

Tor 

x(n)€B 


(VV(xi)dxi . {Xn)dXn}) 

1  n 


1  im 


<P 


. *A  ‘VV”1’' 

x(n)€K  1  n 

m ,  n 


=  1  -  h_1  (mint  h  (  0 )  ,  -*■ 1  ]  •  Nn)  )  <  Nn  . 
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by  suitable  multiplying  and  dividing  by  arguments,  use  of  the  defin¬ 
ition  of  integration,  and  eq.(2)  ,  where 


d  f*  VY1 . V' 

n  "  *yi . *yn  Jy1=--=yn=o 


J  *A  <V*A  (xn,dVdxn  ■ 


x(n)eB 


If  now  for  j  =  1,2,... 

(i)  7^  =  J  ^  (x)dx  s  <  +  ®  , 

X€R  ^ 

defining  probability  density  function  $A,  by 

j 

<K,  (*)  -  T~  (X)  ;  311  x  e  R  ' 

j  j  j 

(ii)  fj  =  |  x<}>A ,  (x) dx  <  +®  , 

x«R  ^ 

(iii)  fl  J  (x-fj  )  2$k,  (x)dx  <  <  +«  , 

J  J  j 

X€R  J 

(iv)  <J>&  is  any  t-norm  bounded  above  by  prod  , 

(v)  4>or  is  any  Archimedean  copula  , 

(vi)  Bn  =  {(x1,...,xn)  |  |xn  -  Pn |  2:  8}  ,  for  all  nil, 

then  for  any  nil  and  any  8  >  0  ,  by  a  slight  modification  of 
the  last  result. 


Jn,8  =  *or 


x(n)«B. 


(VV(Xl)dXl .  *A  (xn)dxn)} 

1  n 


£  i  ■  Pr ( IX  -  v  |  i  a )  , 
o  '  1  n  n 1 


where  now  random  variables  Xj,...,X  correspond  to  p.d.f.'s 


<f>A . tA,  ,  respectively, 

1  n 


o 

n  o 


Again,  the  same  convergence  conditions  hold  analogous  to  the 
discrete  case.  However,  in  this  case,  if  these  conditions  can  be 
improved  upon  by  suitably  restricting  the  shapes  of 

<?>.  then  convergence  can  take  place.  That  is,  geo- 

A1  2 

metric  convergence  in  n  can  be  assured  for  Pr( -  /7  |  i  8)  , 

by  the  Baum-Katz-Read  Theorem  [13],  which  gives  necessary  and 
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sufficient  conditions  for  such  convergence:  essentially,  the  moment 

X  t 

generating  function  of  Xn  ,  E(e  )  as  a  function  of  t  for  any 
a  >  0  ,  and  t  sufficiently  small  satisfies 
X  t 

E(e  n  )  £  const, • e ^ ^ 6  . 

O 

This  condition  is  satisfied  by  the  Gaussian-like  forms  (or  by  suit¬ 
ably  bounded  mixtures  of  them) 

d  -a  .*( x-fj  . )  2 

$  (x)  =  c .e  J  J  ;  all  x  e  R  , 

Aj  J 

where  a^  >  0  is  arbitrary  fixed  and  where  1  >  c ^  >  0  is  arbitrary 
fixed,  whence 


, (x)  =  (l/i  .) 


(x)  ;  all  x  e  R  , 


is  a  p.d.f.  for  Gauss(p., - )  where 

J  V2a~. 

d  rjf 

"  cj  '  %  *  3  ~  1 . n  • 

(These  forms  are  closely  related  to  those  often  chosen  for  "fuzzy 
numbers".  See  [51].) 

Hence,  for  these  substitutions,  by  direct  computations,  for  any 


n 

2: 

1  , 

and 

-2 

d 

1 

n 

o 

n 

2 

n 

I  < 

j  =  l 

J  «  < 
n ,  6 


max  h  •  ii1  lln  •  J 

l<j<n  J  JJ 


1 

J  a 


x  -  *12 


e  2  [  °n  J  dx 


|x-//n|ka 


J"*  •  max 
is  jSn 


ii" 


-  e  dv 


(“°n> 


s  -DF-  max 

lijsnl^j 


2  — 
-a  /2a 


n  '  NT 


a  1 2 


+  4  /* 


by  standard  inequalities  (see,  e.g.,  [1],  p.  296,  #  7.1.13). 

Thus , 


c  ,/Ja ,  <  n  <  +  <»  implies 

J  j  o 


J7,a  5 


where 
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a6  =  J|  •  <°o/a) 


bx  S  Jk  •  n 
fi  o 


,2  .  2 

-a  /o0 


Hence,  if  a  is  chosen  such  that 


a  >  oq  •  log  (-rF-/7o)  , 


then 

0  =  lim  J  „  =  o  (b*n/.J"n)  . 
n ,  o  o 

n-*«> 

(See  also  [52]  for  some  related  results  involving  asymptotic 
properties  of  sums  of  fuzzy  numbers.) 

For  other  asymptotic  results  involving  $or  and  opera¬ 

tion.*,  see  Chapter  9  (E) . 

For  a  recent  paper  on  the  Strong  Law  of  Large  Numbers  for 
fuzzy  random  variables,  following  up  on  Kwakernaak's  concept  of 
fuzzy  random  variables  [148],  see  Miyakoshi  and  Shimbo  [179"]. 

(3)  Walley  and  Fine  [259]  establish  parallels  to  the  classical 
laws  of  large  numbers  linking  sampling  frequencies  of  events 
through  use  of  limsup  and  liminf  operations  with  upper  and 
lower  probability  measures,  as  well  as  generalizing  probabilistic 
concepts  of  independence  and  estimability . 

Noting  that  both  upper  and  lower  probability  measures  corres¬ 
pond  uniquely  to  random  sets  (see  Chapters  3  and  4),  an  empirical 
link  as  described  above  is  not  surprising.  Indeed,  if  a  random 
set  s  «  » ( X )  has  a  known  distribution,  the  incidence  function 
P  .  .  given  in  section  4.2  where  for  all  sets  C  e  ?(X)  of 

(S) 

interest , 


P  ...  (C)  ^  P.  (S  n  c  *  0)  , 

( S )  ^ 

is  an  upper  probability  (or  plausibility)  measure.  Similarly,  if 
S  is  actually  unknown  but  is  estimated  by  any  standard  procedure 

by,  say,  S  ,  then  again  p  „  .  will  be  an  upper  probability 

(S) 

measure,  which  if  S  is  close  to  S  in  any  reasonable  sense,  so 
will  p  be  close  to  p  -  . 

(-)(2)  <S)<2> 

A  similar  situation  holds  for  the  other  extreme:  fuzzy  or 

generalized  sets,  but  in  this  case,  the  random  set  S  involved  in 
generating  a  given  membership  function  is  not  uniquely  determined 
-  only  up  to  one  point  coverages.  Or  alternatively,  using  the 
random  variable  representation  of  a  fuzzy  ret  membership  func¬ 
tion,  one  can  choose  a  space  X  and  a  fixed  collection  a  of,  in 
general,  overlapping  compound  events  from  a  o-algebra  on  X  and  a 
given  sequence  of  independent  identically  distributed  random  vari- 


494 


Goodman  and  Nguyen 


ables  Vj.Vj,...  and  compute  by  any  standard  zero-one  averaging 

n  d  - 

procedure  the  asymptotic  probabilities  1/n  z  Pr(V.  e  c)  =  g  (C)  , 

j  =  1 

for  each  C  e  a  to  obtain  g  as  an  estimator  of  true  membership 

n 

function  g  over  a  where  each  C  is  formally  now  a  "point"  in 

. 

Of  course,  another  alternative  to  the  modeling  of  membership 
functions  is  to  use  formally  a  zero-one  random  process  which  may 
be  repeatedly  sampled,  to  represent  a  subjective  determination. 
(See  also  section  5.1.) 

(4)  Goodman  [79]  has  obtained  asymptotic  forms  for  generalized 
weighted  sums  representing  averaged  fuzzy  data  and  corresponding 
posterior  fuzzy  parameter  sets.  First  recall  the  following  con¬ 
cepts  (see  also  section  8.2): 

Definitions 

Let  and  X2  be  two  given  base  spaces  and  any 

t-norm.  For  any  A  «  sf(X1  x  X2)  ,  define  a  projection 

Pl(A)  €  s(Xl)  by  4>p  £  4>or  (<?>a(x1,x2))  ;  Xx  €  Xj  and 

1  X2€X2 

similarly  define  p2(A)  €  *(x2)  .  Then  it  follows  from  basic 
properties  of  t-norms  that  all  x^.  e  X^  ,  j  =  1,2,  there  are 
(px(A)  |  x2)  e  ?(X1)  and  (P2(A)  (  x2  )  e  7(X2)  ,  called  condi¬ 

tional  fuzzy  sets,  such  that  for  all  Xj  ,  x2 


VX!'V  -  V^unx^V  ’  ^P2(A)(X2)) 


(i) 


“  V*(p2(A)  IXj)  (x2!  '  ^Pl  ( A  )  ^  X1 


)  ) 


Pj  ( A )  is  the  ordinary 


which  are  uniquely  determined  over  supp  ( <$> A )  =  { ( x^  ,  x2 )  |  <f>A  (x.  ,x2  ) 
>  0)  ,  provided  is  strictly  increasing  in  each  argument. 

Note  that  if  A  e  ?(X1  x  X2  )  then 

projection  of  d  (p(A)|x)  ,  A  into  x^  and  (p2(A)|x^)  ,  say,  is 

the  section  of  A  in  x2  ,  given  x^  .  (See  also  section  2.3.8 

for  background.) 

It  then  follows  immediately  that  a  fuzzy  set  form  of  Bayes’ 
theorem  is  obtainable. 


Theorem  0  (Fuzzy  Bayes’) 

Let  A  e  s(X1  x  X2 )  with  <$>& 


a  strictly  increasing  t-norm. 
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Then  over  supp(A)  ,  (p1(A)|x2)  is  a  function  of  (p  (A) | • )  and 

p ^ ( A )  ,  determined  implicity  from  the  following  equations: 

♦s^lPjIA)  |x2)  Ul>  ■  *p2(A)(X2>>  |2> 

”  IXj)  |X21  ’  ♦pj(A)  (>I1  1  1  ’ 

where 

*p  (A)(x2>  ^or  (<f&(<?  (p  (A)  |x  )  (X2)  ’  ^p  (A)(xl))}  (3) 

V*i 


Remark  1 . 

An  obvious  analogy  holds  here  with  respect  to  standard  Bayes¬ 
ian  modeling.  We  can  interpret  ( p2 ( A )  |  •  )  to  be  the  conditional 

fuzzy  data  set,  p^A)  to  be  the  prior  fuzzy  parameter  set  , 

( p 1 ( A ) | x2 )  to  be  the  posterior  fuzzy  parameter  set,  and  p2(A) 

to  be  the  averaged  fuzzy  data  set,  where  x2  may  be  considered  a 

fuzzy  outcome.  (In  the  classical  Bayesian  formulation,  <f>&  =  prod 

and  is  replaced  by  an  intergral  or  sum  which  is  possibly 

weighted. ) 

In  conjunction  with  the  above  remarks,  we  will  assume  that 
the  following  general  fuz*;  sampling  experiment  holds: 

(a)  P1(A)  is  known,  but  A  itself  is  not  known  beforehand. 

(b)  (p  (A) | • )  is  obtained  empirically,  sometimes  through  human 

sources,  via  a  panel  of  "experts"  (rather  than  from  the  unknown  A 
via  Bayes'  theorem). 

Bayes'  theorem  can  then  be  applied,  with  the  above  interpre¬ 
tation,  to  obtain  the  desired  posterior  fuzzy  parameter  set.  The 

key  computation  lies  in  the  evaluation  of  <t  ,  in  equation 

P2(A) 

(3).  In  addition  to  (a)  and  (b),  assume  that  the  following  modi¬ 
fication  holds: 

F.rst  define  the  weighted  averages 

Wn ,  j  =  Vj/  WA,  j  ;  3  =  1 . n  <4> 

where  w^  i  0  are  constants,  and  the  normalized  fuzzy 

prior  set  is  given  by 


t>  ( y  . )  =  w  . 

n,j 


j  =  1 . n  ; 


(5) 
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otherwise,  <fi  is  zero. 

Tu 

n 

If  a  panel  of  experts  is  used,  each  y^  represents  expert 


j  ,  where  e  X: ,  j  = 


=  1,2 . 


(c)  Formally  replace  for  each  nil  in  eqs.  (1),(2),  p  (A) 

everywhere  by  o  and  denote  the  subsequent  value  of  <p  ...  by 
n  P2 ' A ) 

*A„  *nd  ♦(P1(A)|x2)  *(Bn|x2)  ■ 

The  next  theorem  concern  the  asymptotic  behavior  of  and 

hence  or  <p,  ,  (y.)  as  n  »  .  First  define 

n '  x2  : 

an,j(x2)  =  4)(p2(A)  |Yj)  {x2)‘wn,  j  *  (6) 

W  i  *<P2<A)|y  )'*2>  '  <7> 


<f>_  (X  )  =  r  a  .(X  )  . 

An  j-1  n'J 


Asyptotic  behavior  of  average  buzzy  data  and 
posterior  fuzzy  parameter  sets. 


Theorem  1 

Suppose  the  conditions  for  Theorem  0  hold  with  modifications 
(a),  (b),  (c),  for  each  nil,  and  suppose  the  constant  sample 

means  converge: 

lim  <p_  (x  )  ^  <p  (x  )  exists;  x  e  X  .  (9) 

n-“  A  A  Z  1 

n  <»> 

Suppose  also  for  all  nil, 

_€  1  €  2 

a-  n  £  j  <  b-n  ,  ( 10 ) 

where  0  <  a  <  b  and  >  ®  are  constants,  with 

■’l  +  €2  <  1  ■ 

Thus , 


0  <  Cn  5  Wn ,  j  5  Dn  :  J  “  1 . n  • 

0  <  cn  =  (a/b) • n~ ( 1+e l+€ 2 5  <  l 

0  <  Dn  =  (b/a)-n_(1_€l~€2)  <  1 
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where  it  is  assumed  that  n  >  nQ  =  (b/a)1^1  el  s2^  .  Note  that 

for  w1  .  =  1  ,  w  .  =  1/n  satisfies  (11)  . 
n #  j  n,  j 

Then  for  the  fuzzy  set  system  determined  by 

(<&.,<!>..  <<f>  )  =  (l-(-).  prod,  probsum)  ,  for  all  n  ,  and  all 

no  v.  ot  or 

X2  €  X2  , 


*A  <*2>  =  1  -  *  <1-«n.J(X2>>  '  (14) 

n  j  =  l  J 

<P_  (x  )  <  -log ( 1  -  <f>  (x2))  s  <p_  (x2)-(l  +  J  )  ,  (15) 

A  n  A 

n  n 

equivalently, 

1  -  exp ( -<f>-  (x2))  i  <p£  (x2)  <  1  -  exp ( -  ( <J>A  (x2)-(l  +  J  ))  (151) 

n  n  n 

where , 


J  =  - ( log ( 1  -  D  )  +  D  )/D  . 

n  '  a  n'  n'  n 


lim  Jn  =  lim  (Dn/(1  -  D  ))  =0  . 
n-K»  n-*®  1 


Thus , 


lim  <pA  (x2)  =  <|)A  (x2)  =  1  -  exp(-<p-  (x2))  ,  (18) 

n-x*>  n  °°  oo 

uniformly  in  x2  e  X2  ,  with  convergence  rate  determined  by  eqs . 
(15'),  (16)  .  Thus,  for  large  n  ,  all  y^.  e  X.^  , 


(Y-i)  —  an  H(x,)/(1  -  exp  -<pA  (x2))  . 


<B  lx,)'*j'  ^  n , j  v  2 


Theorem  2  (Central  limit  type  theorem  for  a  class  of  analytic 
operator  pairs.) 

Suppose  assumptions  (a),  (b),  (c)  and  eq.  (10)  all  hold. 

Suppose  also  that  (1  -  (• ),  $£>  3  ^e^or9an  system  where 

<}>&  has  a  generator  h  :  [0,1]  -•  [0,+®]  which  is  strictly  decreas¬ 
ing  with  h ( 0 )  =  +  ®  ,  h ( 1 )  =0  .  Then 


<p  (x  )  =  1  -  h  x(  z  h(l  -  <p  (w  a  (x  )))) 
n  2  j=l  &  nj  j  2 


i  1  -  h  (  £  h  ( 1  -  w  . )  ) 


(i) 
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Thus,  if 


z  1  -  h  1 (n- h( 1  -  D  )) 

n 


lim  (n- h( 1  -  D  ))  =  0  , 
n-*+<x>  14 


(20) 


then 

lim  <}>  (x  )  =  0  . 
n-*  -h»  n  z 

(ii)  Suppose  also  that  for  any  fixed  c  e  [0,1]  ,  <p  (z,c)  is 

analytic  in  z  about  some  neighborhood  of  0  and  that  h  is 
analytic  within  some  fixed  neighborhood  of,  and  below,  l 
Suppose,  further,  that  in  eq.  (10)  , 


=  <*(x2)  exists. 


0<  e . 


+  «  <  A 
2  2 


and  that 


lim  (  2 

n-»+<»  1  =  1 


p  .  ( x_ ) • w 
3  2'  nj 


where  for  j  =  1,2,..., 

W  3  (®VZ'a;j(X2))/dz)z=0  • 

Finally,  define 

Aq  =  -(dh(z) /dz)z=1  2  0  . 

Then,  uniformly  in  all  x,  e  X2  , 

<fA  (x2)  =  lim  <?>A  (x  )  =  1  -  h_1(A  -a(x  )) 

00  n-*®  n  u  ^ 

exists,  and  for  large  n 

9(B  |x.)(yj)  =  h_1(h(wnj)  +  h(a.(x2)))  -  h (<PA  (x2)) 
n  c  oo 


Proof : 

(i)  follows  from  the  monotone  property  of  h  .  For  (ii)- 
expand  out  in  a  power  series  the  function  h(l  -  <p&(z,c))  in  z  ' 

yielding  for  all  sufficiently  small  z  ,  for  c  =  a  ^  ( >c  2 )  at 


h  ( 1  -  <P&(  z,c))  =  Ao.pj(x2).z  +  0(Z2) 


(21) 
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0(2  )  indicating  the  remaining  series  has  powers  of  z  to  at 
least  2  .  Substitute  (21)  into  the  first  part  of  eq.  (20),  not¬ 
ing  that 

z  o ( w  2)  <  n-o(D  2)  =  o(n_(1_2(ei+e2) ! )  . 

3=1  nJ 

The  next  results  are  concerned  with  asymptotic  properties  of 
combination  of  evidence,  when  discretization  of  possibility  func¬ 
tions  is  involved,  as  occurs  typically  when  attributes  concerning 
an  unknown  papameter  vector  are  statistical  in  nature  and  are  re¬ 
presented  by  probability  density  functions. 

First,  let  B  e  ?(Rn)  ,  where  for  any  positive  integer  p  , 

A. 

the  p  discretization  4>_  :  D  -•  [0,1]  of  <t>D  is  constructed 

Bp  p  B 

typically  as 

$B  (x>  =  <|>B(x)  ;  all  x  e  D  , 


where  £  R  is  a  discrete  equally  spaced  lattice,  where  each 

Rn-cube  C  (x)  of  D  is  such  that  lim  A  (x)  =  0  and  such  that 

^  ^  p-*-H»  ^ 

for  all  integers  p  2  q  2  1  ,  D  refines  D  .  (D  may  be 

P  P 

modified  for  truncations.)  Then  for  any  t-conorm  <p  : 


Theorem  3  (Simple  limit  theorem  for  discretizations) 
If  $Qr  is  an  Archimedean  t-conorm,  then 


Iim  (V  (x)  )  =  1  . 


Remark  2 . 


If  $or  =  max  ,  then  in  general 


lim  <*>  r  ($B  (x))  <  1 

P 


The  following  theorem  extends  Theorem  2. 


Theorem  4  (Limit  theorem  as  information  granularity  approaches 
zero ) 


(a)  For  each  positive  integer  p  ,  let  be  a  discretization 
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of  R  as  before  and  let  cb  :  D  -»  [0,1]  .  Also,  let 

Cp  P 

<$>„  :  X  x  Rn  -»  [0,1]  be  continuous,  where  X  is  any  fixed  space. 

G 

(b)  Suppose  <p&  is  a  fixed  t-norm  such  that  there  is  a  fixed  6  , 

0  <  a  <  1  ,  such  that  for  each  u  s  [0,1]  ,  <5<}>& ( u ,  v )  /flv  and 

2  2 

<5  $&(u,v)/<5  v  are  finitely  bounded  as  functions  of  v  €  [0,5)  . 

(c)  Suppose  $or  is  an  Archimedean  t-conorm  with  canonical 
generator  h  ,  i.e.  (see  section  2.3.6) 

4>or(u,v)  =  1  -  h  1  ( min  (h(l-u)  +  h  ( 1  -  v)  ,  h  ( 0  )  )  ) 

for  all  u,v  e  [0,1]  ,  where  h  :  [0,1]  -*  [  0 , +® )  is  monotone 

decreasing  continuous  with  h(l)  =0  . 

Also,  assume  there  is  a  fixed  K  ,  0  <  K  <  1  ,  such  that 

2  2 

dh(v)/dv  and  d  h(v)/d  v  are  finitely  bounded  functions  of 
v  e  [  K ,  1  ]  . 

Then  for  all  0  e  X  , 

lim  (g  (0))  =  1  -  h_1 (min  (r(e),  h ( 0 ) ) )  , 

p-»+oo  P 


where 


gp(S)  y  ^or  (<V<Ve  ,X)  '  *C  (X)))  ' 

xeDp 

r(0)  =  lim  Z  (<f>c  (x)-u($  (0,x)))  , 

p-*<=°  xeD  p 
P 


u(u)  =  -(dh(v)/dv)v=i-  (u,v)/<5v)v_c 


for  all  ue  [0,1] 


Proof :  First  note  that 

g  (9)  =  i  =  h~  *  (min  (  z  h(l  -  <f>  (<t>  (0  ,x)  .  (x))),  h(  0 )  )  )  . 

p  xeD  *  G  cp 

P  y 

Then  for  each  u  ,  expand  h(l  -  $  (u,v))  as  a  function  of  v 
about  v  =  0  ,  up  to  second  order. 


Corollary  1. 


( a )  Let  f ,  R  R 

k 

density  function,  k  =  1,2,. 


be  a  bounded  continuous  probability 
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(b)  Let  q  be  any  fixed  positive  integer. 

(c)  Suppose  $  is  any  fixed  t-norm  such  that 

vt  ,  X 


>  rq  =  <3X.1(U1 . Uq)/aV  '  • 


3U  )  .  . 

q  u  =■ • =u  =0 
H  1  q 


(d)  As  before,  denote  D,  as  the  p  discretization  (with 

k ,  p 

mk 

all  previous  properties  holding)  for  R  ,  so  that  letting 

fk  p*xk*  ~  fk(xk)‘^k  p(xk!  ’ 

for  all  x,  «  D,  ;  k  =  1,2,...  ;  p  =  1,2,...,  with 

k  k ,  p 


^  _ 

X,  €  D, 
k  k ,  p 


f,  (x,  )  =  1  , 
k ,  p  k 


i.e.,  f,  is  a  probability  function. 

K  ,  P 

In  turn,  define 

-*q  4  <xj,xa . V  *  Dpq>  4  • 

fpql(iql  4  ♦s.l'L.p"1!1 . 'q.p'V  ' 

(e)  Next,  suppose  2  is  a  fixed  t-norm  such  that  there  is  a 

fixed  a,  0  <  0  <  1  such  that  for  each  u  e  [0,1]  , 

2  2 

2<u,v)/<?v  and  <3  (u,v)/<3  v  are  finitely  bounded  as  func¬ 
tions  of  v  «  [ 0 , a )  . 

(f)  Let  X  be  any  space  and  let 


:  X  x  r 

Gq 


[0,1] 


be  any  continuous  function,  where 


_(q)  d 

m  —  m „  +  *  *  * 

1  q 


(g)  Let 
ator  h  , 


be  any  Archimedean  t-conorm  with  canonical  gener- 


$or(u,v)  =  1  -  h  (min  (h(l  -  u)  +  h ( 1  -  v) ,  h ( 0 ) ) )  , 


for  all  u,v  «  [0,1]  ,  where  h  :  [0,1]  -«  [0,+»)  is  monotone  de¬ 

creasing  continuous,  with  h ( 1 )  =0  . 

Also,  assume  there  is  a  fixed  K  ,  0  <  K  <  1  ,  such  that 
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2  2 

dh(v)/dv  and  d  h(v)/d  v  are  finitely  bounded  functions  of 
v  e  [K.l]  . 

Then,  for  all  e  e  X 


lim  (g‘q)(e))  =  1  -  h  1(min  (r  (a),  h ( 0 > ) )  , 
where  for  all  9  €  X  , 

1  v  (••*„>■  ‘'ql<sq>11  ■ 

( X  « D  q  )  q 

q  p 


=  rq 


/I 


<a'5a>)  •  77  fk(Sk)d5v  - 


x  eR 

~q 


m 


(q/l  q 


k=l 


u(u)  =  - ( dh( v)  /dv)  v=1  •  (<5$&  2(u,v)/av)v=0  , 


for  all  u  e  [0,1]  . 


Remark. 

Frank's  Archimedean  family  (see  section  2.3.6)  can  be  used  to 
supply  <t>&  i  >  tt>&  2  •  ^or  *or  Theorem  4  or  Corollary  1. 


8 . 5  Subjective  probability  and  general  logical  systems. 

There  have  also  been  a  number  of  papers  attempting  to  in¬ 
terpret  fuzzy  set  concepts  within  the  purview  of  subjective 
probability  theory.  (See,  e.g.,  Nalimov  [180],  Hanrahan  [105], 
[106].  Nativig  [182],  Hisdal  [113]). 

In  particular,  consider  the  important  independent  work  of 
Watanabe  [268]  and  Giles  [70  -  73].  Both  have  developed  a  basic 
mathematical  theory  for  fuzzy  sets,  in  a  somewhat  different  con¬ 
text  from  Zadeh,  in  which  they  are  concerned  also  with  an 
empirical  basis  for  the  numerical  evaluation  of  the  membership 
functions,  the  various  logical  connections,  and  other  relations 
involved  in  fuzzy  set  models.  Consider  the  following: 

Let  X  be  a  fixed  space  of  points  or  objects  and  ^(X)  a 

fixed  space  of  properties,  attributes,  predicates  or  fuzzy  sets. 
Define  for  any  CSX  and  A  e  ^ ( X )  .  (C  s  A)  or  A ( C )  as  that 

atomic  or  primitive  proposition  representing  compatability  or 
truth  that  C  has  property  or  attribute  A  or  C  is  a  subset  of 
fuzzy  set  A  .  Introduce  also  the  operators  "not,"  "or" 
which  formally  have  the  usual  classical  logical  interpretations, 
and  then  form  the  class  of  all  compound  propositions  as  the 
well-formed  strings,  under  not  ,  &  ,  or  ,  of  the  primitive 
propositions,  so  that  at  least  a  Boolean  algebra  s  results. 
(See  also  2.2.)  A  similar  result  holds  for  all  well-formed 
strings  of  predicates.  Watanabe  restricts  himself  to  the  special 
case  of  C  =  (x)  ,  x  e  X  ,  resulting  in  fuzzy  set  membership  forms 
(x  e  A)  ,  A  e  ?1(X)  and  fuzzy  set  membership  functions  <f>A(x)  • 
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In  addition,  he  uses  first  a  fundamental  composition  operator 
on  to  develop  his  logical  operators  over  a  subspace  *(X) 

of  ^(X)  ,  where  the  composition  operator  is  idempotent.  Both 

Watanabe  and  Giles  approach  the  evaluation  of  a  proposition 
throuth  the  concept  of  a  "test"  in  an  extended  sense.  Watanabe 
stresses  more  the  nonsub jective  probability  ideas  that  in  effect  - 
again  restricting  himself  to  (x  «  A)  -  <f>A(x)  <  the  fuzzy  set 

membership  level  of  x  in  A  ,  is  to  be  interpreted  not  as  the 
truth  (x  e  A)  -  (x  e  A)  being  either  true  or  not  -  but  rather  as 
the  probability  of  observing  (at  least  theoretically)  that  x  e  A 
or  equivalently,  the  probability  that  zero-one  r.v.  V  .  =  1  , 

X  ,  A 

with  1  -  $  (x)  =  probability  (V  .  =  0)  ,  where  $.(x)  ,  can  be 

A  X  /  A  A 

estimated  by  a  test  procedure  which  counts  the  percentage  of  suc¬ 
cesses.  If  the  test  for  a  proposition  can  not  be  repeated  -  due, 
for  example,  to  a  time  factor  t  occurring  for  A^  €  sp  ^  ( X ) 

where  for  t '  *  t"  ,  in  general,  A^,  *  A^,,  -  the  test  is  called 

dispersive  (as,  e.g.,  in  a  micro- level  or  single  sample  situa¬ 
tion),  otherwise  the  test  is  called  non-di spersive .  Giles  differs 
from  Watanabe  ir.  that  he  stresses  the  subjective  and,  in  general, 
non-probabilistic ,  i.e.,  possibilistic  interpretation:  Using  a 

pragmatic  approach  through  a  "rational"  betting  scheme,  he  first 
considers  prob(A(C))  ,  which,  when  it  exists  is  the  classical 
Savage  Baysian  subjective  probability  that  A(C)  occurs,  relative 
to  a  test  individual.  In  general,  however,  prob(A(C))  cannot  be 
obtained  but  rather  a  probability  interval  , 

[low  prob  A ( C )  ,  upp  prob  (A(C))]  >  is  identified  with  upp  prob 

A(C)  .  For  compound  propositions,  Giles  establishes  a  dialogue 
procedure  for  evaluation.  The  result  is  that  in  addition  to  the 
numerical  evaluations  being  obtainable  by  this  procedure,  a  par¬ 
ticular  specialization  of  Giles'  system  is  seen  to  be  the  same  as 
Lukasiewicz  t^-logic  (both  semantically  and  syntactically)  [214]. 

On  the  other  hand,  Watanabe 's  system  differs  considerably  in  form 
(q.v.).  Giles  obtains  a  number  of  interesting  relations  between 
probabilities,  possibilities,  and  Zadeh's  original  concept  of 
possibilities,  which  may  be  proved  with  a  set  -  theoretic  context 
not  dependent  upon  the  original  betting  scheme.  Some  of  these 
results  are  similar  to  other  relations  developed  between  various 
definitions  of  uncertainty  measures.  (See  Dubois  and  Prade  [51], 
Chapter  II.  5  for  a  more  comprehensive  listing.  See  also  Banon 
[11],  Kruse  [146]  and  Puri  and  Ralescu  [206]  as  well  as  the 
extensive  work  of  Klement  and  Schwyhla  [142]  and  Klement  et  al . 
[141]  in  developing  fuzzy  probability  measures  and  their  relations 
to  classical  probability.  See  also  section  3.3.) 

(i)  A  (Giles)  possibility  function  p  may  be  defined  by 

P  :  a  -*  [0,1]  ,  where  $  may  be  identified  with  some  Boolean 

algebra  of  subsets  of  X  ,  say,  (by  Stone's  Representation 
Theorem)  with  p  being  nondecreasing  w.r.t.  subset  inclusion, 
p(0)  =  0  ,  p ( X )  =  1  and  for  all  A,B  e  a  , 

max ( p ( A ) , p ( B ) )  <  p(AUB)  S  p ( A )  +  p(B)  . 

(ii)  Every  finite  additive  probability  function  is  a  (Giles) 
possibility  one. 

(iii)  The  argument-wise  (w.r.t.  sets  A  e  a)  supremum  of 
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any  collection  of  (Giles)  possibility  functions  is  also  a  (Giles 
possibility. 

(iv)  Any  (Giles)  possibility  function  is  the  argument-wise 
supremum  of  a  class  of  (finitely  additive)  probability  functions. 

(v)  A  Zadeh  possibility  function  p  :  a  -♦  [0,1]  where  for 
all  A , B  e  a  , 

p(A  u  B)  =  max(p(A) ,  p .  ) )  , 

is  of  course  a  (Giles)  possibility  function  and  for  any  fuzzy  set 
membership  function,  :  X  -•  [0,1]  ,  sup  <|>A(x)  is  a  Zadeh 

xe  B 

possibility  function  of  B  e  a  . 

See  also  Smets  (e.g.  [243])  for  a  subjectivist  -  betting 

approach  to  fuzzy  set  modeling. 


Remark. 


Subjective  probability  and  possibility  are  a  necessity  in 
general  decision  analysis. 

In  expert  systems,  estimates  of  uncertainty  quantities  of 
interest,  e.g.,  certainty  factors  in  MYCIN,  weights  of  evidences 
in  clinical  decision-support  systems  (see,  e.g.,  Spiegelhalter  and 
Kni 1 1-Jones ,  [246]),  are  obtained  subjectively.  The  subjectivity 
is  also  dominant  in  applications  of  possibility  theory.  However, 
possibility  measures,  as  proposed  by  Zadeh  to  model  expert  know¬ 
ledge,  differ  from  other  uncertainty  measures  in  particular,  from 
subjective  probability  (e.g.,  Fine,  [58],  Good,  [76])  by  their 
interpretation  and  their  associated  calculus.  Different  modeling 
processes  lead  to  different  mechanisms  for  combining  of  evidence. 
Thus,  for  example,  when  probabilities  are  used  to  express  prior 
knowledge,  then  Bayes'  formula  is  used  to  update  information  or  to 
combine  evidence.  On  the  other  hand,  in  statistical  systems,  the 
concept  of  weight  of  evidence  (Good,  [76]),  can  be  used  to  re¬ 
present  concepts  such  as  "doubt,"  "ignorance"  and  "conflict  of 
evidence".  (See  Spiegelhalter  and  Knill-Jones  [246].)  More 
precisely,  following  Good  [76],  the  information  concerning  a  pro¬ 
position  A  provided  by  a  proposition  B  ,  given  a  proposition 
G  ,  is  defined  to  be: 


log 


Pr ( A  |  B , G ) 
Pr ( A  |  G) 


The  weight  of  evidence  in  favor  of  an  hypothesis  H1  as  compared 
with  hypothesis  H2  provided  by  the  evidence  E  ,  given  G  ,  is 
defined  by: 

Pr ( E  |  H  ,  G) 

W(H  /H  :  E  |  G)  =  log  - = -  • 

Pr ( E  I  Hz,  G) 

Note  that,  both  information  measure  and  weight  of  evidence  have 
the  additivity  property. 

The  situation  where  expert  knowledge  should  be  taken  into 
account  is  very  common  in  systems  engineering.  For  example,  in 
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Structural  Engineering  (e.g.,  Au,  et  al . ,  [8]),  due  mainly  to 
insufficient  statistical  data,  the  ignorance  of  structures  and  the 
doubt  in  the  actual  hypothesis  considered  by  a  design  engineer, 
field  observations  of  experts  should  be  used  in  any  final  decision 
analysis.  More  precisely,  the  reliability  of  engineering  struc¬ 
tures,  the  factor  of  safety  and  the  margin  of  safety,  which  are 
defined  in  terms  of  the  imposed  load  effects  S  and  the  structure 
resistance  R  (considered  as  random  variables),  are  appropriate 
measures  of  reliability.  The  probability  of  structure  failure  Pf 

can  be  computed  if  the  joint  probability  distribution  of  (S,R) 
is  known.  However,  in  practice,  often  this  is  not  the  case!  On 
the  other  hand,  the  degree  of  "doubt"  for  the  hypotheses  consi¬ 
dered  in  computing  Pf  should  be  assigned  somehow  to  reflect  this 

type  of  system  uncertainty.  The  value  of  Pf  is  then  used  in  the 

optimization,  say,  of  the  utility  function  relating  the  risk  of 
failure  to  economical  and  social  consequences .  This  decision 
making  problem  is  clearly  more  general  than  traditional  statist i- 
cal  decision  problems  due  mainly  to  the  kind  of  data  available  and 
the  type  of  uncertainty  measures  involved.  A  procedure  for 
computing  Pf  is  suggested  in  Au,  et  al .  ,  [8]  where  subjective 

expert  knowledge  is  expressed  in  the  form  of  "judgmental  factors" 
reflecting  the  uncertainty  in  assigning  a  joint  probability 
distribution  to  (S,R)  .  The  question  is  then:  how  to  effectively 
assign  these  judgmental  factors  in  a  given  situation,  or,  to  put 
it  in  another  way,  how  to  model  or  encode  expert  knowledge?  Once 
these  factors  have  been  subjectively  assigned  by  an  "objective" 
procedure  or  by  ad  hoc  quantification,  we  are  facing  two  different 
types  of  uncertainty  measures  in  a  single  decision  problem. 

Some  of  the  above  concepts  will  also  be  found  in  Chapter  9, 
where  an  application  to  data  association  is  considered. 
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CHAPTER  9 


AM  APPLICATION  OF  GENERAL  LOGICAL  DECISION  THEORY 
TO  A  PROBLEM  OF  DATA  ASSOCIATION 


This  chapter  contains  a  detailed  application  of  our  theory 
developed  in  previous  chapters  to  a  problem  of  data  association. 


A.  Description  of  the  problem. 

Too  often  in  the  past,  linguistic  based  information  has  been 
neglected  in  favor  of  "hard"  numerical  or  statistical  data.  How¬ 
ever,  with  the  advent  of  rule-based  systems  and  new  approaches  to 
the  modeling  of  uncertainties,  both  literal  and  statistical  infor¬ 
mation  may  be  treated  from  a  common  basis  and  effectively  combined. 

The  following  example  illustrates  how  general  logical  systems 
may  be  used  to  model  linguistic  and  numerical  information  in  order 
to  estimate  a  parameter  of  interest. 

Although  the  problem  considered  here  will  be  the  target  data 
association  one,  obvious  changes  in  notation  and  forms  yield  a  simi¬ 
lar  model  for  medical  or  fault  diagnoses.  The  results  are  based  on 
the  work  developed  in  [80],  [77],  [82],  [83],  [86],  [91"],  [91111]. 

By  a  target  history,  we  mean  a  collection  of  data  concerning  a 
single  target  of  interest  which  is  sorted  in  two  different  ways:  by 
time  and  by  attribute.  Time  indices  correspond  to  report  sampl¬ 
ings  and  could  vary  from  a  single  report  to  a  large  collection  of 
reports.  Examples  of  attributes  include:  geolocation  ( 2  -  or  3  - 
dimensional  positions  with  possible  additional  parameters:  velocity 
or  even  acceleration  information),  classification- type ,  on-board 
sensor  characteristics,  hull  lengths,  or  even  visual  descriptions, 
the  latter  resulting  from  an  actual  sighting  of  a  vessel.  This  may 
also  be  interpreted  in  linguistic  form  such  as  "irregular  hull 
appearance",  "oblong  shape  like  a  type  Q",  "is  approximately  x 
yards  long",  "appears  to  be  flying  a  dark  triangular-shaped  flag  - 
maybe  Antartica",  or  "has  two  or  perhaps  three  missile  launchers 
on-board" .  Other  attributes  that  may  be  present  at  various  times 
include:  emitter  fundamental  frequencies  of  equipment  (numerical), 
tentative  classification  by  intelligence  sources  (numerical  or 
literal),  mode  of  operation  (literal),  as  well  as  various  measure¬ 
ments  of  system  parameters  obtained  by  various  sensor  systems. 

Each  of  these  attributes,  of  course,  could  vary  as  a  function  of 
time.  The  domain  of  possibile  values  for  each  such  attribute  is 
assumed  known  and  may  vary  considerably. 

For  example,  the  domain  for  geolocation  could  be 

dom(geo)  ={(x,z)  |  x  e  a,  X  €  ?}  , 
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2 

where  a  s  R  is  some  region  of  interest  and  »  is  the  set  of  all 
positive  definite  2  by  2  matrices  or  equivalently  ellipses  of 
some  fixed  confidence  level.  In  general,  a  and  a  are  time  de¬ 
pendent  . 

The  domain  for  classification  could  typically  consist  of  the 
iaoel&  ,  C2 ,  .  .  .  ,  v-m  »  which  can  weii  represent  overlapping  stla. 

The  domain  for  frequency  could  be  a  range  of  numerical  values  in 
hertz,  while  a  typical  visual  description  attribute  could  be  ob¬ 
served  ship  flag  color  with  domain  consisting  of  {red,  dark  red, 
orange,  yellow,  pink,  white,  ...}  ,  or  the  attribute  representing 
ship  length  could  be  in  pure  linguistic  terms:  very  short,  short, 
long,  about  300  feet,  very  long,  etc.  Although  the  possible  domain 
values  for  each  attribute  could  also  vary,  let  us  assume  that  rela¬ 
tive  to  some  common  fixed  reference  time  -  say,  to  the  present  -  all 
information  has  been  suitably  updated  for  both  target  histories  and 
all  attribute  domain  values,  with  no  missing  attribute  for  either 
target  history. 

A  track  history  may  contain  as  little  as  a  single  report  or  it 
may  consist  of  a  large  and  growing  (in  time)  number  of  reports  as¬ 
sembled  from  one  or  more  mechanical  or  human  sensor  sources  over  an 
extended  period  of  time.  Often,  due  to  independent  information 
sources,  two  or  more  track  histories  may  be  established  which  are 
suspected  (or  actually  equated)  of  representing  in  reality  the  same 
target  or  platform.  Most  commonly,  this  is  based  on  either  simple 
unimpeachable  identifications  or  upon  reasonable  matching  of  geo¬ 
location  information,  which  may  be  further  enhanced  by  a  comparison 
of  other  attributes.  All  comparisons  are  carried  out,  of  course, 
with  respect  to  suitable  common  updatings  (or  smoothings  or  pre¬ 
dictions)  of  the  reports  in  the  track  history  files.  In  the  past, 
both  the  preliminary  screening  phase  and  the  more  refined  final 
decision  phase  In  determining  whether  two  or  more  track  histories 
"correlate",  i.e.,  associate  to  the  same  platform,  or  not,  were 
based  typically  upon  a  combination  of  heuristic  and  statistical 
hypotheses  testing  procedures  with  emphasis  placed  upon  geolocation 
attribute  information.  In  turn,  If  two  or  more  histories  were 
decided  to  represent  the  same  object,  the  geolocation  information 
was  combined  and  most  often  some  variation  of  the  well-known  Kalman 
filter  procedure  was  used  for  the  ensuing  tracking  aspect.  If  the 
track  histories  were  decided  as  being  associated  with  distinct  tar¬ 
gets,  similar  tracking  procedures  were  carried  out  for  the  separate 
tracks.  (For  further  background,  see  the  extensive  surveys  of  the 
field  made  in  [87]  and  [273].  In  addition,  [84]  presents  a  general 
mathematical  model  of  the  correlation  problem,  again  with  emphasis 
on  the  geolocation  aspect.  See  also  Bowman  [23].) 

For  simplicity,  consider  two  target  histories,  each  arising 
from  separate  data/sensor  sources.  The  more  general  situation  of  n 
target  histories  could  be  handled  either  by  a  direct  generalization 
of  the  procedure  developed  here  or,  pairwise,  by  the  establishement 
of  a  confusion  -  or  likelihood  matrix  of  possible  associations  (see 
Table  0)  in  which  a  number  of  standard  algorithms,  e.g.,  [146‘]  are 
available  for  converting  to  an  assignment,  i.e.,  permutation  matrix. 
This  is  also  related  to  the  Birkhoff  decomposition  theorem  which 
exhibits  the  (nonunique)  probability  distributions  of  a  random  as¬ 
signment  matrix,  whose  expectation  is  the  given  "confusion"  matrix, 
[.178]  . 
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Entries  represent  likelihoods  or  possibilities  of  cor¬ 
relation  between  arbitrary  pairs  of  track  histories. 
Clustering  techniques  may  be  used  to  group  these  his¬ 
tories.  Tentatively , two  groups  here  may  be  formed  by 
inspection:  {1,2}  and  {3,4,5}  . 

Table  0 


An  Example  of  a  Correlation  or  Confusion  Matrix 


Table  1  illustrates  symbolically  a  tableau  comparing  data  by 
attribute  for  two  given  track  histories. 


Attribute 

Track  history  1 
updated  to  time  t 

Track  history  2 
updated  to  time  tQ 

A, 

2<D 

*(2) 

1 

i 

1 

A_ 

2<1} 

2<2) 

2 

2 

2 

A_ 

S(l) 

2(2) 

3 

3 

3 

A 

2d) 

2(2) 

m 

m 

m 

Table  1 


Tableau  of  Observed  Attribute  Data. 
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In  Table  1,  the  values  ,  S$.2)  €  D  ^  3  dom(A^)  ,  represent 

observed,  measured,  or  reported  values,  j  =  l,...,m  . 

Symbolically,  we  may  write 


i  =  (1(1) ,^(2) ) 


S<2> 


^(2) 

m 


5(2) 

m 


(1) 


It  should  be  noted  that  any  or  all  of  the  attributes  a  ,  .  .  .  ,A 

could  themselves  represent  compound  forms  of  simpler  attributes. 
For  example,  A1  could  represent  jointly  geolocation  and  frequency 

or  geolocation,  frequency  and  ship  length,  etc.  Table  2  illustrates 
these  ideas  further  by  displaying  a  simulated  typical  data 
comparison  by  attributes.  [Due  to  paging  error, Table  2  is  located  on  page 
542.) 


B.  Error  tables. 

In  addition,  it  is  assumed  that  associated  with  each  attribute 
Afc  is  a  known  error  table  P^  of  size  n^  by  n^  where 

n^  =  card(dom(Ak)  ,  where  for  any  two  values  Z£  '  Zk  €  dom(Ak)  , 
the  entry 

Pk<Zk  '  Zk> 

=  probability  or  possibility  that  Z£  is  the  true  value  given 
is  observed  value.  (2) 

Thus,  for  any  Z £  e  domain  (Afc)  ,  Pk(- |Z£)  is  either  a  probability 

function  or,  more  generally,  due  to  possible  overlapping  of  events, 
representing  values  in  dom(A^)  (to  be  made  clearer  below), 

( •  |  zj^ )  is  a  possibility  function,  k  =  l,2,...,m  .  If  P^  repre¬ 
sents  a  probability  function  and  perfect  errorless  measurements  were 
always  made,  then 

Pk  =  In.  <nk  V  ’  (3) 

k 

identity  matrix,  for  some  k,  k  =  l,...,m  . 

In  general,  any  P^  can  be  obtained  in  one  of  two  ways;  either 

analytically  or  subjectively.  The  former  typically  occurs  for 
geolocation,  frequency,  and  other  attributes  where  the  number  of 
contributing  factors  may  be  determined  so  that  all  relevant  physical 
and  statistical  relations  may  be  modeled  reasonably  faithfully. 
Thus,  geolocation  has  typically  associated  with  it  multivariate 
Gaussian  error  distributions;  on  the  other  hand,  frequency  can  have 
a  more  complicated  analytic  form  for  its  error  table  derived  from 
e.g.,  dopplar  shift  considerations,  using  transformation  of  proba- 
bibility  techniques.  Of  course,  all  such  distributions  must  be 
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suitably  discretized  and  localized  over  the  regions  of  interest.  In 
these  analytic  cases,  the  entire  error  tables  could  be  tabu¬ 
lated  or  more  conveniently  written  in  formula  form  with  evaluations 
held  back  until  needed. 


On  the  other  hand,  if  attribute  A^  represents  relatively 

complex  concepts  which  cannot  be  easily  treated  by  an  encompassing 
statistical  model  -  such  as  involving  many  contributing  factors 
which  either  cannot  be  fully  quantified  and/or  where  there  is  only 
incomplete  knowledge  of  the  actual  conditional  and  joint  probabili¬ 
ties  involved,  then  in  such  a  case,  may  be  more  appropriately 

and  simply  obtained  by  querying  directly  a  parcel  of  experts.  In 
this  situation,  P^  appears  in  numerical  form  -  although  the  domain 

of  may  be  linguistic  or  numerical  in  nature.  The  panel  of 

experts  act  as  human  integrators  of  knowledge  in  producing  P^  . 

See  Table  3  for  a  simple  numerical  example  of  an  error  table. 


Z2 

7 11 

A 

B 

C 

D 

E 

F 

Z2  I 

I 

1 

A  j 

0.9 

0.4 

0.7 

0.3 

0.0 

0.5 

B  1 

0.3 

0.8 

0.4 

0.5 

0.1 

0.5 

C  j 

0.6 

0.4 

0.9 

0.7 

0.8 

0.6 

D  1 

0.2 

0.5 

0.7 

1.0 

0.9 

0.2 

E  | 

0.0 

0.0 

0.7 

0.9 

0.9 

0.4 

F 

0.4 

0.3 

0.5 

0.1 

0.3 

0.1 

Entries  represent 

the  possibilities  that  a 

value 

Z2 

is 

true 

given  value 

Z2 

reported 

or 

observed , 

for 

the 

attribute 

Class.  Note 

the 

absence  of  summing  to 

unity 

for 

rows  or 

columns  due 

to  overlapping 

sets 

of  elementary 

events 

each 

class  represents. 


Table  3 


A  Typical  Example  of  Error  Table  Pk  Obtained  from 
Directly  Pooling  a  Panel  of  Experts  for  A4  =  Class. 


Let  us  illustrate  both  the  analytic  and  subjective  modeling  of 
P^'s  by  the  example  of  classification: 

Consider  first  the  situation  where  classification  can  be 
obtained  analytically: 

For  simplicity,  consider  the  situation  described,  more 
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generally,  in  section  5.2  (F)  :  Let  G1  -  {a,b,c}  ,  G 2  -  {I, II)  , 

G3  =  {1,2}  ,  be  the  domain  sets  of  the  only  factors  considered  for 
possible  classes.  For  example,  G^  could  represent  lengths,  , 
weights  ,  heavy ,  light),  and  Gg  ,  snapes  (of  type  1  or  type  2). 
Defino  then  the  classes  C1,...,Cg  by  the  following  table: 


Class  C  . 

J 

G2j=G2  values 

for  C  . 

3 

G3j=G3  values 
for  Cj 

C1 

a ,  b 

I 

1,2 

C2 

b,  c 

I 

1 

C3 

a ,  c 

II 

1,2 

C4 

a,b 

II 

1,2 

C5 

a ,  b ,  c 

II 

1  1 

C6 

c 

I 

2  1 

1 

_ L 

Table  4 

Definitions  of  Classes. 


Define  the  following  elementary  events  -  possible  triples  of  values 
from  >  G2  '  G3  determine  the  classes: 


=  (a, 1,1) 

x4  =  {a, II, 2) 

x?  =  (b.11,1) 

x10  =  (c,I'2) 

x2  =  (a, I, 2) 

x5  =  (b.1,1) 

X8  =  ( b , II , 2  ) 

x:1  =  ( c , II , 1 ) 

x3  =  (a, 11,1) 

xg  =  (b , I , 2 ) 

xg  =  (c,I,l) 

x12  =  { c , II , 2 ) 

Table  5 


Multiplicative  Definitions  of  Elementary  Events. 


Thus  we  can  tabulate  for  each  class  which  elements  are  in  it, 
and,  conversely,  for  each  element,  which  classes  contain  it: 
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Class  C  . 

J 

Elements  in  C^ 

Class  C  . 

j 

Elements  in  C ^ 

C1 

X1 <  x2 '  x5 '  x6 

C4 

x3,  x4,  x7,  x8 

C2 

X5 '  x9 

C5 

x3,  x7'  X11 

C3 

x3,  x4,  x11#  x12 

C6 

X10 

Table  6 

Elementary  Events  in  Each  Class. 


Here,  X  =  {Xj ,x2 , . . . ,x12>  ,  a  =  {Cj,  .  .  .  ,Cg}  £  9*  ( X )  ,  and  the 

collection  of  all  classes  containing  x.  is  c,  .(.a)  . 

J  \  X  .  / 


Table  7 

Classes  Containing  Elementary  Events. 


Next,  let  Vj  be  a  random  variable  over  corresponding  to  the 

true  value  of  factor  j  and  a  random  variable  also  over 

corresponding  to  the  observed  value  of  factor  j  ,  for  j  =  1,2,3. 
Suppose  also  that  (V^.Vg.Vg)  are  a  mutually  statistically  inde¬ 
pendent  triple,  as  are:  {^i'^2'^3)'  <Vl'^2'^V'  (^l'V2'V3)*  and 

(WV3>  ■  Suppose  further  that  the  probability  functions 

p(Vj|^j)  and  p(^j)  ,  j  =  1,2,3  ,  are  all  known,  where  we  use  the 

convention  of  identifying,  where  necessary,  random  variable  and 
their  probability  functions  through  their  typical  outcomes.  Define 

random  variable  V  =  (V1>V2'V3^  corresponding  to  the  true  joint 
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factor  values  determining  the  true  class,  and  similarly, 

V  =  ( V ^ , Vj , Vg )  for  the  observed  (or  reported)  class. 


follows  that: 


V  €  c, 


Then  it 


n  z  p ( v  .  | v  .  =  t ) • p ( v  .  =  t)  /  n  z  P(v.  =  t; 


3=1  t.Gjk 


j=l  t«Gjk 


p(Cffl  is  true|Ck  is  observed)  =  p(VeCm|$eCk)  =  $B ( cm ! ck ) 


1  =  1  ft«G  .. 

J  jk 

t  *€G  . 
jmJ 


p(V  =  t  ‘  |  V  .  =  t )  •  p  ( V  .  =  t)  /  Jl  Z  p  ( V  .  =  t) 

J  J  J  J  3  =  1  t€G..  J 


Thus,  p ( C  true|C,  observed)  is  a  computable  function  of  C  for 
m  k  m 

each  in  and  may  be  interpreted  as  the  fuzzy  set  membership 

function  or  possibility  function  for  possible  choices  of  which 

class  gave  rise  to  the  observation  ,  for  any  fixed  k  , 

k  =  1,2,... ,6  .  These  functions  are  generated  by  the  conditional 


random  variables 
compound  events 


^  e  ck)  evaluated  over  the  collection  of 
In  turn,  the  possibility  functions 


also  represented  as  the  one  point  coverage  functions  of  the  random 
subsets  (S  |  ^  e  Ck )  of  a  which  represent  the  possible  inter¬ 

actions  of  the  classes  k  =  1,2,..., 6.  (See  Chapter  5.)  Here,  for 

any  k  ,  a  typical  outcome  of  (S  |  ^  e  C.  )  is  some  C  >(-*■)  ,  the 

K 

collection  of  all  classes  C^,  C2,  ...  such  that  they  interact  with 

respect  to  x^  ,  i.e.,  contain  Xj  .  Table  6  presents  the  ten 

distinct  such  collections  of  classes  making  up  the  range  of 
( S  |  ^  €  Ck )  .  The  probability  function  for  this  random  subset  is 

obtained  from 


(S  |  $  e  Ck)  = 


C  (V)  {'*) 


P ( S  =  S'  I  $  €  Ck)  =  p  (V  =  V- )  ,  (7) 

for  any  outcome  S'  of  S  corresponding  to  V’  of  V  ,  which,  in 
summary,  imply 
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€  C  )  =  p  ( C 

K  m 

€  S  | 

%  €  Ck)  =  p(V 

e  C  |  ft  < 
m  ' 

CO 

U 

for  all  C  e 

A  =  (C.  . 

for  k  =  1  ,  .  .  , 

.  . 6  .  where  the  Droba- 

bility 

m 

functi 

’  1  6 ' 

on  for  random  set 

o 

(S  1  v  6  ck) 

is  given 

in  (see  also 

Table 

7 )  eq. 

(9)  . 

p  ( s 

“  <C1> 

1  V  e  ck) 

= 

z 

i=l , 2 , 

( P  ( V  =  X  1  V 

6 

€  ck))  *  ] 

P(S 

=  (C3,c 

4'V  1  * 

€ 

Ck>  “ 

p(V  =  X3  |  V  € 

ck}  - 

P(S 

=  <c3,c 

4>  1  V  € 

V 

=  p(V 

# 

(9) 

P(S 

=  <C1  ,c 

2>  1  $  « 

ck> 

=  P(V 

-  -5  1  ?  e  Ck> 

i 

P(S 

=  <V 

1  V  e  Ck) 

= 

p(V  = 

-10  1  5  •  V  ■ 

etc .  , 

which 

may  be 

evaluated 

through 

eq .  ( 4 )  . 

Note  that  the  function  4>g{-  I  ^  €  C^)  ,  in  general,  does  not  sum 
to  1  over  a  ,  since  the  outcomes  Cj,...,Cg  are  compound  events 
which  are  not  all  disjoint  for  random  variable 

(V  |  V  «  Ck)  ,  Sc  =  .  Thus,  we  should  not  normalize  <fB  to 

maSce  it  a  probability  function:  it  is  a  possibility  function  even 

though  it  was  obtained  analytically. 

Consider  now  the  converse  of  the  first  situation  : 

We  know  the  classes  of  relevance  a  =  {C,,...,C_>  ,  but  we  do 

1  b 

not  know  all  the  contributing  factors  and/or  we  do  not  have  a  handle 
on  all  the  required  probability  functions  involved  in  the  factors; 
but  we  do  have  before  us  a  panel  of  experts  who  will  give  the  possi¬ 
bilities  of  true  class  values  given  observed  ones.  To  simplify 
notation,  assume  that  a  particular  C^  is  chosen  and  all  results, 

as  above,  are  conditioned  upon  the  event  V  e  .  Suppose  then  the 

possibilities  of  a  particular  class  being  the  true  one,  given  Ck 

is  observed,  is  determined  by  the  panel  to  represent  in  effect  the 
fuzzy  subset  B  of  a  which  is  given  through  the  membership  (or 
possibility)  function  <}>_  :  a  [0,1]  ,  where  without  loss  of 

O 

genera  1 i ty 

0  <  ^(Cj)  <  *B(C2)  <•••<  *B(C6)  =  1  . 

First  recall  the  fundamental  results  previously  mentioned  in 
sec t ion  5.2  ( F )  : 

There  are  random  subsets  S  of  a  such  that 

$R(C  )  =  Pr ( C  e  s)  ;  all  C  e  a  . 
d  m  m  m 


(  10) 
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Guided  by  (also  previously  mentioned)  the  results  concerning  minimal 
entropy  or  nested  forms,  we  may  choose,  e.g., 


S  =  SU(B)  =  ^g1 [U , 1 ]  , 


(11) 


U  uniformly  distributed  random  variable  over  [0,1]  .  In  this 

case,  the  probability  function  for  S  is,  by  defining  for 
m  =  1 , ....  6  , 

c<"1  a  «Vc»+i . V  '•  V°0>  - 0  '  |12> 

p(S  =  C(m))  =  $B(Cm)  -  ^(C^),  m  =  1 - ,6  .  (13) 

Thus , 

rng(S)  =  (C(m)  |  m-l,...,6)  £  9(a)  ,  (14) 


-*  =  a ■  ,  and 


c{c  (rng(S))  =  (a  |  Cm  e  a  £  rng(S)} 
m ; 

=  (C(1)  ,•  •  •  ,C(m) )  ;  m  =  1 _ ,6  ,  (15) 

Let 


g(S)  =  u  C{c  (rng(S))  =  (C(2) . C(6))  .  (16) 

m=l  m 

Noting  here  for  S  :  n  -»  9  (a)  ,  we  may  choose  n  =  [0,1]  ,  with 

U(u)  =  w  ,  for  all  wen.  Then  it  follows  that: 

S”1(C(m))  =  VCm)]  *  (17) 

m  =  1  ,  .  .  .  ,  6  . 


In  turn,  define  random  variables  V,  .  and  space  X,  .  arbi- 

( m)  ( m ) 

trary  such  that 


(surjective)  V(m)  :  (♦B(Cm.1l,  V  Cm  >  ]  -  X(m) 

Then  define  space  X  by 
^  6 


,  m  =  1 , 


.  6  . 


X  =  U  X 
m=l 


U>  ' 


(18) 


and  random  variable 


V  :  a  -  X 


by:  for  each  u  s  n  ,  there  is  a  unique  m(w)  , 


is  m  ( w )  <  6 
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such  that  w  «  <VCm(u)-l>'  (Cm(u>)] 

whence 


V  =  V  f  u ) 

(w  )  (m(u> )  ) 


(19) 


Next,  define  mapping  a  : 

A  :  x  -  »(X) 

(9>(X)  being  the  collection  of  all  ordinary  subsets  of  X)  ,  by,  for 

any  Cm  «  a. 


Cm  3  V(S"1(C  (rng(S)))) 

m 1 

m 


-  v(IO*VCm>l>  "  “  X(j>  '  “  " 


,  6  . 


(20) 


Then  a  is  an  injective  mapping  and  it  follows  that  for  any 
outcome 


/x 


S  = 


(21) 


P(V  .  c„>  -  PIC.  ..  SI  -  VV  ■ 


(22) 


for  all  C  e  a  . 
m 

In  particular,  we  may  let  each  X(m)  be  minimally  any 
(disjoint)  singleton: 


X(m)  "  {y(m)  }  <  >"  =  1  <  •  •  •  -6 


Altermatively ,  let  each 


X,  .  =  C(m) 
(m) 


implying 


V  =  S  ;  X  *  rng(S)  =  {C{ 1 1 . 


(23) 


(24) 


(25) 


In  any  case,  we  have  obtained  both  ( nonuniquely )  a  random  sub¬ 
set  S  of  a  ,  representing  interactions  of  the  classes,  and  a 
random  variable  V  ,  as  above,  such  that  the  evaluation  of  V  over 
certain  compound  events  from  its  elementary  event  space  and  the  one 
point  coverage  function  of  S  all  coincide  with  <J>B  ,  analogousto 

the  first  situation.  (See  [86]  for  further  details  and  the  general 
set  of  constructions.)  Thus,  again,  the  function  *_  obtained  from 

D 

the  panel  of  experts  is  a  possibility  function  and  not  a  probability 
function  in  general:  it  need  not  sum  to  one  -  as  in  the  first  situ¬ 
ation  -  due  to  the  presence  of  overlapping  events  (shown  by  the 
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random  variable  V  over  X  ) . 


C.  Inference  rules  and  Matching  tables 

Inference  rules  connect  possible  combinations  of  matchings  of 
attributes  (to  various  degrees)  of  any  two  given  track  histories  to 
the  correlation  levels  of  the  histories.  We  denote  inference  rules 

as  R  ,  t  =  1 , 2 ,  .  .  .  ,  r  ,  where  the  tr  inference  rule  R^_  is  of 

the  form  "if  -  or  iff  -  (antecedent)  attribute  A^  at  k  ~  matches 

for  any  two  target  histories,  for  all  k  e  ,  then  (consequent) 

the  two  target  histories  are  considered  to  represent  the  same  target 
(or  not  to  represent  the  same  target)  to  degree  t ^  "  ,  where 


0  *  Jt  S  (1,2, ... ,m>  , 


0  5  at,k  '  *t  5  1 


(26) 

(27) 


for  all  k  e  ,  t  =  l,...,r  . 

k  represents  the  degree  of  matching  relative  to  attribute  A^ 

for  any  two  target  histories,  and  may  be  originally  in  linguistic 
form  such  as  high,  medium,  low,  etc.,  or  in  numerical  form.  Simi¬ 
larly  0  3  t  £  1  represents  the  degree  of  data  association  (or 

lack  of  it),  k  e  ,  t  =  l,...,r  . 

In  symbolic  form,  analagous  to  the  results  obtained  earlier, 
the  truth  of  rule  Rt  is  evaluated  for  any  possible  set  of  values 

(or  "dummy"  data) 


Zj  =  (Z  (1) 
Jt  Jt 


Z,(2) ) 
Jt 


(28) 


where 


k€ 


.( j) 


e  dom(Ak)  , 


(29) 


for  j  =  1,2,  arbitrary  fixed,  and  prior  target  association 

possibility  or  probability  function 


Q  =  f(0 


;  30) 


evaluated  at  any  possible  association  level  9  e  [0,1]  (suitably 
discretized,  if  required): 


N (Rt(Q|Aj  )n 


♦  ,(Mj  (Z  U).Z  (2) 
Jt  Jt  Jt 


(1  -  Q) 


(31) 


where 
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:  dom(  Ak)xdom{Ak)  -♦  [0,1] 

is  a  possibility  function  (actually  a  fuzzy  set  -  binary  relation) 
representing  the  matching  table  corresponding  to  attribute  A^  , 

k  €  ;  t  =  l,...,r  :  (More  on  construction  of  the  M^'s  later.) 

Analagous  to  the  modeling  of  the  error  tables  ,  inference 

rules  Rt  may  be  obtained  from  analytic  considerations  or  subjec¬ 
tively,  by  polling  a  panel  of  experts.  (The  latter  procedure  has 
been  well  developed  in  a  number  of  related  artificial  intelligence  - 
expert  systems,  but  usually  within  the  framework  of  classical  logic 
and  probability  theory.  (See  e.g.,  [24]  or  [236].) 

Consider  the  case  of  "analytic"  inference  rules.  These  are 
based  -  and  essentially  equivalent  to,  or  extensions  of  -  classical 
hypotheses  tests: 

Let  r  k  o  be  an  unknown  paremeter  and  for  each  possible  r  , 

(y(j,|r)  ,  j  =  i , 2  ,  be  two  random  variables  (possibly  vector  valued 

and  often  assumed  statistically  independent)  representing  observa- 


520 


Goodman  and  Nguyen 


tions  of  data  for  some  attribute  A1  ,  say,  given  r  ,  where  it  is 

assumed  the  relevant  probability  functions  or  probability  density 

functions  are  all  known.  Let  a  =  A(Z)  ,  where  Z  =  (Z^^.Z^2^)  , 

be  a  test  statistic,  with  scalar  function  A  £  0  having,  for  each 
fixed  value  t  ,  known  prob.  dist.  function  Fr  which  is  assumed 

increasing  over  [0,b]  ,  b  -  a  pos .  const.,  possibly  infinite,  and 

such  that,  pointwise,  Ff  is  increasing  in  r  .  Then  A  may  be 

chosen  based  on  invariance,  uniformly  most  powerful  test  properties 
or  other  considerations.  (See,  e.g.,  [154].) 

Then  a  typical  test  for 

H  :  0  £  t  £  t 
o  o 

VS.  (37) 

H .  :  t  £  t  £  b 

1  o 

is  given  by,  at  any  fixed  significance  level  a,  0  <  a  <  l  : 

For  aii/  observed  Z  : 


Decide  H  iff  A (Z)  £  D 
o  '  '  a 

i*f  1  -  Fr  (A(Z))  *  1  -  P  (Dq)  s  a  ; 
o  o 

Decide  Hj  iff  A(Z)  >  Dfl 

iff  1  -  Fr  (A ( Z ) )  <  1  -  Ft  (D  )  s  a 
o  o 

where  thus 

Pr(Decide  |  Hq  true)  £  a  .  (39) 

Thus  if  data  asso  !ation  for  the  two  target  histories  is  to  be 
based  solely  on  this  s t  of  hypotheses,  i.e.. 


Decide  Q  =  1  iff  Hq  is  accepted  ,  (40) 

Decide  Q  =  0  iff  H1  is  accepted  ,  . 

then  the  entire  procedure  is  mathematically  equivalent  to  the 
inference  rule  evaluation 

n  (R(ct)  ( Q  |  Z )  )u  =  <{>  (M.(a)(Z)  ,  Q°  =  1)  (41) 

■1  CO  1 

where  the  matching  table  here 


is 


An  Application  of  General  Logical  Decision  Tbeory 


521 


M. 


(Z) 


1  iff  1  -  F  (A (Z) )  *  a  , 

O 

0  iff  1  -  Fr  (A (Z) )  <  a  , 
o 


(42) 


for  any  Z  e  dom(A1)  x  domfA^)  . 

Clearly,  a  "softer"  more  general  form  is  obtained  by  replacing 
by  the  natural  full  non-truncated  matching  table  M1  where 
for  all  such  Z  as  above, 

M  (Z)  =  1  -  Fr  (A (Z) )  ,  (43) 

o 

the  randomized  significance  level,  and  replacing  exponent  0  by, 
say,  0  i  £  1  ,  so  that  becomes 


yl 

II  {  R2  {  Q  I  Z)  )  II  =  ^(MjfZ)  ,Q  A)  . 

An  example  of  the  above  is  A1  =  geolocation,  with  r.v. 

distributed  statistically  independent  Gauss 2(p^^,z^^)  ,  j 

such  that 


(44) 

(Z|r  ) 
1,2  , 


A (Z)  =  (Z 


(1) 


.(2) 


)T(X(1) 


(1) 


-  Z 


(2) 


with 


d  .  (1) 
=  (A' 


-  #,<2>)?(*<1> 


+  z 


(2 


'  )‘(P 


(1) 


-^(2)) 


(45) 


(46) 


where  tq  is  some  upper  bound  chosen  for  maximal  separation  dis¬ 
tance  (or  t-  =  0  could  be  chosen)  ,  and  hence  (A(Z)|r)  is 

distributed  noncentral  chi-square  with  two  degrees  of  freedom  and 
noncentrality  parameter  r  ,  etc. 

If  several  attributes,  say,  A1,...,An  each  has  an  associated 

statistical  test  with  test  statistics  A  ^  and  well- 

behaved  cumulative  prob.  dist.  functions  Fj  ,  j  =  l,...,n  ,  with 

Z(i) ' * • ' ,Z(nj  statistically  independent,  then  these  may  be 

combined  into  one  compound  test  for  compound  attribute  (A1~***-An) 

by,  e.g.,  the  construction  of  the  minimum  of  individual  randomized 
significance  levels 

K(Z)  =  min  (1  -  Fj(A  (Z( ) ) )  ,  (47) 


For  any  Z  : 
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Accept  HQ  iff  all  1  -  Fj(Aj(Z(j)))  *  a  « 
j  =  1 ,  .  .  .  ,n 

iff  K(Z)  2  a  ;  (48) 

Accept  H1  iff  K(Z)  <  a  , 

where  we  note  the  level  of  the  test  is 

Pr ( Decide  H1|Hq  true)  £  1  -  (1  -  a)n  .  (49) 

(For  background  or  more  general  constructions  of  compound  hypothese 
tests,  see,  e.g.,  [154]  and  [18].) 

Thus,  if  data  association  were  based  only  on  this  compound 
test,  one  would  have  equivalently 

HR^a)  (Q I  Z)  n  =  <$><=>(M(a)  (Z)  ,Q°  =  1)  (50) 


where  compound  matching  table  is 


Mc(a)(Z)  = 


1  iff  K(Z)  S:  a 
n  iff  K(Z)  <  a 


However,  as  in  the  modeling  of  the  error  tables,  in  general,  not  all 
of  the  relationships  between  data  association  and  attribute  matching 
is  obtainable  as  above;  a  panel  of  experts  may  be  utilized  to  derive 
further  relationships  -  especially  those  involving  non-statistically 
oriented  attributes  such  as  flag  color,  class! f ication ,  visual 
sightings,  etc. 

Again,  as  before,  the  inference  rule  corresponding  to  the  com¬ 
pound  test  described  above  in  fully  general  form  is 

II  (Rc  (  Q  |  Z  )  )  II  =  tc=,(Mc(Z)  ,Q7  )  (52) 


where 


MC(Z)  e  K(Z) 


0  5  i  S  1 


In  general  -  especially  for  those  attributes  involved  in  sta¬ 
tistical  hypotheses  tests  -  matching  tables  are  distinct  in  form 
from  error  tables.  However,  in  the  case  of  many  subjective  ori¬ 
ented  attributes,  matching  tables  may  be  reasonably  assumed  the 
same . 

See  Table  8  below  for  an  illustration  of  a  typical  inference 


rule . 
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Natural  Language  Form  From  Combination  of  Panel  of  Experts  and 
Standard  Statistical  Hypotheses  Testing: 

R  =  "If  two  track  histories  are  close  to  each  other  and  have 

somewhat  similar  fundamental  frequencies,  but  differ 
considerably  with  respect  to  motion  type, 
then  there  is  some  -  but  not  much  -  chance  that  they 
correlate . " 


Formal  Language  Form: 

For  any  2  and  0  , 

Truth  {if  modif ierhi^h( ( ' 


in  Mx)  &  ( z[2) 


in  M  ))  & 


m0difie3W<V 

modifierlow((z'1) 


in  M5)  &  ( Zg  ‘ 
in  Mg )  &  (Z<.2) 


in  M5))  & 
in  Mg)) 


then  modifier  ,  (0  IN  Q ) ) 

very  low'  x ; 


3.  Full  Truth  Evaluation-Possibility  Function  Form; 
for  any  Z  and  0  , 

<p  (anLeced(Z)  ,  conseq(Q(e  )  )  )  , 

anteced  (  Z  )  =  <p&  ( Mj  {  z[  1 1  ,  Z  [  2  ]  )  ^  ,  Mg  (  Z* 1 5  ,  Z  J2  ]  )  ^  ,  Mg  (  Z^1 }  ,  Z^2  1 
conseq ( Q ( 0 ) )  =  Q ( 0 )  . 


>*•>  . 


(i)  The  constants  are  to  be  chosen  such  that 

0  <  G  <  Eg  <  E2  <  E1  , 

(ii)  Logical  System  1  =  (  <J>  ,  <f>&,  <pQr)  =  (1  -  (•).  prod ,  sum ) 

chosen, 

(iii)  M1 ( Z^ 1 ^ , Z^ 2 ^ )  =  1  -  cumul .  distrib.  funct.  of  chi-square 

random  variable  with  2  degrees  of  freedom  evaluated  at 

W  =  {z[1]-z[2)  )?(ri+Z2)-1.  {z[X)~z[2)  )  ,  zrz2  error 
covariance  matrices. 

(iv)  Mg  is  similarly  evaluated  from  a  closed-form  formula  while 
M.  is  obtained  from  a  numerical  table. 

D 


Table  8 

An  Example  of  Symbolization  of  an  Inference  Rule. 
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Additional  comments  on  error  tables,  matching  tables,  and 
inference  rules. 


Error  tables  and  matching  tables 


An  elaborate  example  of  an  analytically  obtained  error  table 
for  an  ideal  classification  attribute  has  already  been  presented 
(Tables  4-8,  etc.)-  The  actual  error  table  for  this  attribute 
would  then  be  a  6  by  6  table  of  numbers  lying  between  0  and  1  , 
being  evaluated  correspondingly.  (See  eqs .  (4),  (5)). 

Another  example  may  be  furnished  by  the  attribute 
A3  =  length,  with  dom(A3)  =  (B1 , B2 , B3 , B4 }  , 

where 


B1  =  very  short 
B2  =  short 

B  =  moderate 

B4  =  long 


approx,  less  than  50  ft. 

approx,  between 
50  and  100  ft. 

approx,  between 
100  and  200  ft. 

appprox.  over  200  ft. 


(54) 


Thus,  P3  consists  simply  of  a  4  by  4  table  furnished  by  the 
consensus  of  a  panel  of  experts  in  response  to  the  question: 

p(true  length  =  Bm  )  observed  length  *  Bk)  ?  (55) 

for  1  <  k,m  s  6  ,  p  being  probability  or  possibility. 


Table  9  presents  plausible  values  for  the  above  equation  (55): 


Table  9 

Error  Table  for  Eq .  (55). 


Thus,  Table  9  shows 

P3(Z3j)  =  B2  I  S3j)  =  V  = 


0.8  , 


(56) 
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regardless  of  j  =  1  or  2  . 

It  is  not  unreasonable  here  to  allow  the  matching  and  error 
tables  for  to  coincide,  following  suitable  symmetrization , 

i. e.,  M  =  P_  .  Similar  remarks  hold  for  the  classif ication  example 

O  O 

previously  presented  (for  the  second  situation,  not  the  first). 

Consider  now  some  examples  of  analytically  obtained  error  and 
matching  tables.  These  could  typically  involve  attributes  such  as 
geolocation,  frequency  or  class  (as  under  the  first  situation  in  the 
previous  example) . 

Thus,  for  A1  =  geolocation,  we  may  have,  regardless  of 

j. J  =  1.2, 

P1(2j;i)  |  2<J))  =  dirac(Z  -  Z  )  •  Gauss^  ( S ,  1 )  (57) 

where  zj^  and  are  arbitrary  with 

zjj)  3  (X.z)  ;  zjj)  =  (&,!)  .  (58) 

X,&  «  a  s  ir2  ;  *  suitably  discretized,  z  ,z  e  *  =  class  of  all  2 

by  2  positive  definite  matrices,  and  hence 

2(j),|U)  e  dom(A1)  =  a  x  »  , 

and  Gaussj  is  a  probability  function  over  a  formed  by  suitably 
discretizing  the  bivariate  distribution  Gauss2  . 

However,  as  noted  previously,  the  associated  matching  table  M1 
is  quite  different  from  P^  here: 

M1(z}1),z}2))  =  1  -  Fr  (A (Z*1’ ,zj2) ) )  ,  (59) 

o 

for  all  Z  =  (Z^1^ ,Z^2^ )  €  domfA^)  x  dom(A1)  ,  with  A  given  in  eq. 
(45) )  . 

Another  example  of  an  analytically  derived  error  table  is  that 
for  A2  =  signal  frequency: 

From  simplified  physical  considerations,  the  relationship 
between  true  source  frequency  and  received  frequency  is  given  by  the 
doppler  shift  equation  [255]  given,  regardless  of  j , j  =  1,2,  by 

JU)  =  (i  _  (VT/VS)-  coss)-zJj) 

-  observed  frequency  at-rest  receive  (60) 

VT  =  target  speed,  (61) 

Vs  =  speed  of  sound  in  water 
(assumed  constant). 


(62) 
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z<j)  a 
L2 


true  frequency  at  source. 


(63) 


8  =  angle  between  receiver  (64) 

position,  target  location  and  target  motion 
velocity  measured  counterclockwise. 

(See  Figure  1) 


Figure  1 . 


Basic  geometry  of  droppler  shift. 


Suppose  then  e  is  distributed  uniformly  over  [0,2*]  and  VT 
is  distributed  uniformly  over  [a,b]  for  some  suitable  chosen  range 
0  a  f  b  ,  with  ®-vT  >  and  all  statistically  independent. 

Then  it  follows  by  use  of  standard  transformation  of  probability 
techniques  that  we  can  derive  the  conditional  probability  density 
function 


p„(z'j) =  (!/(*• (b-a) ) ) • W' 


fg  (W/a, W/b) ,  if  w  <  0 
o 

g  (W/b, W/a) ,  if  W  >  0  , 
O 


where  for  all  u,v  «  [0,1]  , 


2  2 

,  .  d  , ,  ,  1  +  1-u  1  -  4  1-v 

g  ( u ,  v)  =  ( 1  /2  )  log  - -  •  - -  - . - 

°  2  2 

1  -  1-ti  1  +  1-v 


W  —  Vg*  (  /Z^]  )  -  1)  , 


w*  3  vs-  (2<j)/(Z^j)  )2) 


Again,  by  suitable  discretization  over  dom(A2)  s  R+  ,  the 
corresponding  error  table  P2  can  be  constructed  from  p2  in  eq . 
(65)  . 

On  the  other  hand,  the  matching  table  M2  here  may  be  treated 

simply  by  first  assuming  Zj1'  ,  Z^2^  are  statistically  independent 
Gaussian.  (See  also  eqs .  (37)  -  (46).) 
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Inference  rules 

Eqs .  (37)-{53)  adequately  cover  inference  rules  analytically 

obtained  through  statistical  tests  of  hypotheses.  However,  it 
should  be  pointed  out  again,  that  the  relationship  between  the 
rejecting  or,  accepting  of  the  hypothesis  concerning  a  single 
attribute  or,  more  generally,  a  collection  of  attributes  (in  the 
case  of  a  combined  test  of  hypotheses),  and  the  final  decision  to 
"correlate"  -  i.e.,  decide  the  target  histories  represent  the  same 
target,  is,  in  general,  a  subjective  choice  of  how  much  weight  or 
reliability  the  hypotheses  test  has  with  respect  to  the  final 
decision.  For  example,  two  different  targets  could  truly  possess 
the  same  or  similar  locations,  frequencies  and  classes  and  yet  not 
be  the  same  object.  Thus,  all  factors  contributing  to  this  iden¬ 
tification  should  be  accounted  for,  and  since  in  practice  this  is 
not  always  feasible,  this  may  be  incorporated  into  the  inference 

rules,  by  softening  the  consequents,  i.e.,  replacing  Q°  a  1  by 

7  i 

Q  as  in  eq.  (44)  or  eq .  (52)  . 

On  the  other  hand,  inference  rules  may  be  formulated  completely 
subjectively,  by  querying  a  panel  of  experts  in  the  field.  Two 
typical  examples  of  this  include: 

(a)  R  =  "If  A„  0.6  -  matches  and  Ae  0.8  -  matches, 

1  J  O 

then  0.7  -  level  intensification  holds  for  Q."  (69) 

Here,  Jj  =  (3,8)  ,  and  we  may  symbolize  ,  for  any 


> 


(70) 


Zg1'.  Z<2)  €  dom ( Ag )  ,  Zg^.Zg2'  e  dom(Ag)  ,  and  for  any 

Q  €  [0,1]  : 


Rj(Q  I  Zj  ) 

-  ■WI'V2”1'2!21"1 


5,  (Mgfz'11  , Zg2  5  ) )  ) ,  Q2'3) 


(71) 


(b)  r2  =  A1  does  not  match  that  well  but  A 2  highly 

matches,  then  correlation  probably  does  not  occur".  (72) 

Here,  J 2  =  (1,2)  ,  and  we  may  symbolize,  using  the  techniques 
illustrated  previously  as,  e.g.: 
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where  <t>low  ,  <t>high.  =  [0.1]  -  [0,1]]  may  be  modeled  as  ex¬ 

ponents  as  before  or  otherwise,  as  in  the  previous  examples. 

E.  Combination  of  evidence  for  the  correlation  problem. 

The  basic  structure  of  the  technique  for  combining  of  evidence 
may  be  summarized  as  follows: 

(a)  Observed  data:  2  categorized  by  attributes  and  target 
history . 

(b)  Error  tables  ,  k  =  l,...,m  ,  which  can  be  used  to 

connect  £  with  any  Z  . 

(c)  Matching  tables  ,  (with  Pfc  =  or  Pfc  *  M^) 

k  =  l,...,m  ,  leading  to: 

(d)  Inference  rules  R,.  ,  t  =  l,...,r  which  can  be  used  to 

V. 

connect  any  Z  and  Q  s  P(s)  . 

Now,  let 

R  =  {Rt  |  t  =  1,2 . r}  , 

P  =  (Pk  |  k  =  1 . m}  . 

Then  from  the  property  of  conditional  possibility  functions  - 
see  also  the  more  general  results  of  section  8.3  -  the  posterior 
possibility  function  for  «  is  given  by,  omitting  the  subscripts 
projx(CQ)  ,  C^,  k  =  1,2,...,  CQ  ,  etc.  for  simplicity,  prior  to 

diagonalization  and  normalization, 

<M«  I  1  :  R.P)  =  <]>or  <f>(8,Z  |  |  ;  R ,  P )  ,  (74) 


where 

d  m 

G  =  x  (dom(A.  )  x  dom(A  )  )  , 

k-1  K  K 

and  general  fuzzy  set  system  ( <f>not ' ' ^or *  is  cnosen' 
where 


<*>(«.  Z  |  |  ;  R,P)  h  <|)&(<?){e  |  Z,|  ;  R.P)  ,  ( Z  |  |  ;  R,P)) 

—  I  2  :  R)  ,  <p(Z  |  2  :  P)) 


assuming  that  sufficiency  relations  for  the  inference  rule  and  error 
table  effects  hold: 
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$(0  |  Z, |  ;  R,P)  2  <M  £  !  Z  ;  R)  , 
<t>(Z|l;R,P>  =  <t>(Z||;  P)  . 


Consider  now  the  error  table  effects.  This  expression  does  not 
really  represent  combination  of  evidence,  but  rather  the  joint  error 
function.  Hence,  a  natural  evaluation  is 


i(Z  |  |  ;  P)  =  (J»  {Pfc(Z.  \  K))  ■ 


k'  k 
,  m 


where 


pk(Zk  1  V 

■  vpk'zk"  '  vzl2>  i2k2l>»  ■ 


for  all  Z^^ ,2^  e  domCA^)  ,  j  =  1,2,  ;  k  =  l,...,ra  . 

On  the  other  hand,  the  inference  rule  effect  does  represent 
combination  of  evidence. 

The  first  approach  follows  the  idea  of  the  extension  principle, 
a  specific  case  of  which  is  related  to  the  fuzzy  set  extension  of 
the  Weak  Law  of  Large  Numbers  (see  section  8.4): 

Approach  1 . 

Let  gr  :  [0,l]r  -*  [0,1]  be  a  given  function  such  as 


(a)  gr(yl . V  5  *Wrj‘yj 


0  <  w  .  <  1  ,  2  w  .  =  1  , 

r3  1-1  rJ 


(b)  gr(Yr- 

. . ,y  )  =  8 

r'  y1 , 

.  .  .  , y  -yi 

[  0  iff 

3  yi  *  yj 

= 

l  s  i,j  i 

[  yi  lff 

*< 

»-* 

II 

II 

»1 

where  gr  represents  some  combination  of  descriptions  of 
each  rule.  Then  by  the  "extension  principle" 
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*<£  I  l  R>  =  *gr(Ri . R)«Z'Q) 

relative  to 

Q(ei)  =  q(e  x ) 

• 

Q(«r)  -  q(er) 


'  or 

(fl! . V  1 

€  g'1!0) 

.  ( discretized 
In  the  case  of  (b)  , 


Lt=l , . . . , r 


(»Rr(q(«t) |Zj  ) )n ) 


4>(«  I  Z  ;  R)  = 


4>&  (»Rt(q(0t)  I  Zj  )  II  ) 

(t=l _ _  r ) 


(33) 


(84) 


For  simplicity,  from  now  on,  we  drop  the  notation  ll(R.)ll  for 
the  simpler  . 

Approacn  2 :  (See  [80],  [82],  [83].) 

Let  :  [0,l]r  [0,1]  be  a  nondecreasing  combining  func¬ 
tion,  such  as  gf  =  or  gr  =  wtdsum  as  in  (a)  of  Approach  1. 

Then  there  is  a  unique  fuzzy  subset  A  of  Gq  x  [0,1]  such  that 
for  all  ax . ar  e  [0,1], 


cfA1[gr(«1 . arM]  2  n  (Rt  1[aj,l]) 


(85) 


minimally  with  respect  to  set  ordering.  Indeed, 


*A  =  9r°<Rl . Rr>  * 


(86) 


If  gr  =  (  ,...,)  is  a  strict  Archimedean  t-norm,  then  there 


is  an  h  :  [0,1]  -*  R  with  h  decreasing,  h ( 1  )  =  0  ,  such  that 

-1  r 

9r(Y1 . Yr)  =  h  (  z  h(yt)  )  (87) 

(See  [ 140]  .  ) 

Then 


wr<a1 . ar>  ■  (gro<Ri- — Rr>>  1[gr(ai . ar )  ■  1  ^ 


-1  r  -lr 

( V  |  h  (  2  h(R  (V)  )  )  2:  h  Z  h  (a  )  )  ) 

t=l  t=l 
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=  {V  |  Z  h(Rt(V) ) 
t  =  l 


S  Z  h(a  )  }  . 

t=l 


(88) 


In  particular,  for  a  ^  =  • • =a  ^=a 


w r(a,  .  .  .  ,a) 


2  h(R  (V))  <  h (a ) } 
t=l 


(89) 


Note  -  as  seen  in  previous  examples  -  when  fuzzy  set  systems 

=  (1  -  ( • ) , prod , bndsum)  is  chosen  and  all  possibility  functions 

involved  are  also  probability  functions,  then  the  above  equations 

all  reduce  to  the  classical  Bayesian  forms. 

See  Figure  2  for  a  flow  chart  summary  of  the  combination  of 

evidence  technique  used  here.  This  presents  only  a  simplified 

scheme  for  combining  evidence.  Based  on  the  results  in  section  8.3, 

(see  Remarks  1,2)  and  in  8.4  (Theorems  3  and  4),  in  general,  it  may 

be  better  to  use  a  hybrid  logical  system  in  evaluating  the  posterior 

function  for  e  .  Thus,  if  A, , . . . ,A  ,  denote  statistically  based 

1  m 

attributes,  while  A  A  „  denote  subjective  based  ones,  it 

m  +1  m 

may  be  appropriate  to  choose  a  <p&  1  to  combine  the  first  group  for 

error  table  effect  and  some  2  for  the  second  group,  in  place  of 

a  single  .  Moreover,  i  '  2  maY  weH  be  chosen  as 

Archimedean  t-norms.  On  the  other  hand,  since  inference  rules 
typically  operate  on  combinations  of  common  or  similar  attributes  - 
varying  basically  only  on  intensification  modifications  for  possible 
attribute  matches  -  <$&  =  min  may  be  most  appropriate  for  obtaining 

the  overall  inference  rule  effect.  Also,  in  obtaining  the  marginal 
posterior  function  for  e  ,  it  follows  from  the  above  quoted  results 

of  sections  8.3  ,  8.4  that  the  single  projection  operator  <J>or  may 

be  replaced  by  max  over  Z"  =  ( Z. )  „  and  by  some 

Archimedean  t-norm  over 

Z 1  =  (Zk)k=1  m,  ,  where  as  usual,  Zk  =  (Z^^,  z£^  )  ,  for  track 

histories  i  and  j  ;  k  =  l,...,m"  .  In  addition, 

<t>  =  max  seems  most  appropriate,  according  to  the  above  reasoning, 

for  obtaining  normalization  as  in  section  8.3.  See  also  the  com¬ 
ments  in  Goodman  where  several  principles  or  guidelines  are 

developed  for  choosing  types  of  t-norms  and  t-conorms.  Unfortunate¬ 
ly,  the  choice  of  the  most  appropriate  logical  operators,  except  for 
the  above  general  guidelines,  still  involves  a  mixture  of  art  and 
intuition. 

As  an  application  of  the  above  principles,  the  combination  of 
evidence  technique  (based  on  eqs .  (74)-(80),  (84),  etc.)  becomes: 


^diag (  8  1  l'5'5>  =  ,  4  (  ♦  (  e  I  d^g:  |.R.P).  ^Univ (  diag  5  ]  »  (90) 

solving  for  <p{9  |  diag;  S,R,P)  ,  0  5  8  $  1  , 

l.jjdlag)  =  max  (0  |  2,R,P)  ,  (91) 

0S8S1  ~ 
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* . .  (e 
Tdiag 

|  2,R,P)  =  4>(fl  1  2,R,P),  -a  \ 

(®k  9  )k=1 . m- 

(92) 

$(0  |  2,R,P)  =  max  [<J>  (<f>{9,2  |  t,R,P) 

~  ~  ~  or  ~  ~  ^  ~  ~ 

1  ~  [an  Z' 

(93) 

4>{0,z  | 

f,R,P)  =  ! 

(94) 

<t>G(e  .z) 

=  4>&  2(p"(Z"  1  |"),  R(.  1  Z))  , 

(95) 

R(9 ,Z) 

=  min  (Rt(*t  1  Z))  ' 

t=l , 2 ..... r 

(96) 

P‘  (Z1  1  1’  ) 

■  pkl2i2,i2k2,'>> 

k=l ,2, ... ,m‘ 

(97) 

P" (Z" ||" ) 

■  ♦S.3<»S,3<Pk«Zill'5,!;II>'Pk<Zk2,l2i21>» 

k=m ' +1 , . . . ,m" 

(98) 

where  Z  =  (Z1 ,Z" 

')  «  1  =  etc-<  and  ^&,2'  ^&,3’ 

♦a. 

are  Archimedean  t-norms  and  <t>  is  an  Archimedean  t-conorm  such  as 

Tor 

chosen,  e.g.,  conveniently  from  Frank's  family  -  see  section  2.3.6, 
For  other  approaches  to  the  combination  of  uncertainties,  see 
[23],  [24],  [236],  the  critique  of  [236]  as  presented  in  [120]  as 

well  as  the  discussions  in  Chapter  8. 


Prescreening:  Pick  Pair 
of  Track 
Histories 


Library 


Store :  P , M , R 


Inputs 

I 

7 

Q(e  ) 

_ 1 

Compute:  V=Set  of  integers 
representing  attributes 

appearing  in  Z 

Select  from 
library: 

P={Pk|k  in  V) 

R=  { R^  |  V^_  subset  of  V} 

Compute  for  all  possible  Z  ,  Error  Table  effect: 

<MZ|Z.P)  =  4>&  (Pk(zk|zk)) 

(k  in  V) 


Compute  for  all  possible  Z  and  e  ,  Inference  Rule  effect: 
4>(e  |Z,R)  =  (Rt(Q(et)  |Z)  ) 


over  all  t 
such  that 

subset  of  V 


Compute  joint  posterior  function  for  all  Z  and  e 
^(e,Z|Z,H,P)  =  e&(<|)(e  |Z.R)  ,<t)(Z||,P)  ) 


Compute  posterior  function  for  all  e 


i(e  |  Z  , R , P )  = 


Tor 

(over  all  Z) 


(<p(e  ,  Z  |  Z  ,  R ,  P )  ) 


Normal  1 ze 

/ - ^ - 5 - 

'Output:  Tabulation  of  $  (  9  |  Z  ,  R  ,  P  )  vs.  e 


Figure  2 .  Flow  chart  structure  for  combination  of  evidence 
algorithm  applied  to  correlation  problem. 
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In  Figure  2.  the  following  legend  holds: 

2  is  observed  data,  Z  is  variable  dummy  data, 


<P1 

'P2 ' ‘ - • 

) 

is 

set 

of 

all 

error  tables , 

<R1 

,R2 ' • ’ ’ 

) 

is 

set 

of 

all 

inference  rules 

,m2,  .  .  . 

> 

is 

set 

of 

all 

matching  tables 

Q(e)  is  prior  possibility  function  of  e  , 

9  is  correlation  level  between  two  track  histories, 
5  =  ($  , <{>& , <}>  )  is  logical  system  chosen. 
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The  computation  of  $(0|f  :  R<P)  .  etc.  has  been  established  as 

an  algorithm  in  PACT  -  Possibilistic  Approach  to  Correlation  and 
Tracking  [80],  [82],  [86].  This  involves  two  looping  processes: 

computations  of  Z  and  Q  =  q{0)  -  As  an  approximation,  whenever 

an  attribute  Afc  is  such  that  either  a  difficulty  arises  in  the 

obtaining  of  P^  or  domfA^)  ,  although  discrete,  is  too  large,  or 

simply  to  reduce  computations  of  looping  through  Z  ,  one  has  the 
option  of  assuming  formally  that  Pk  represents  errorless  mea¬ 

surements,  i.e.,  Pk  =  I  .  In  this  case,  this  is  equivalent  in  the 
computations  for  4>(0  |  Z;  R,P)  for  any  Rt(Q  I  Z)  with  k  e 

(i.e.,  involving  A^)  to,  in  effect,  replace  Mi^Zk^  in 

antecedent  of  Rt  by  Mk(^k>  •  See  a9ain  Figure  2  for  an  outline 

of  the  PACT  algorithm. 

The  PACT  algorithm  may  be  considered  an  example  of  a  nonse¬ 
quential  inference  rule  expert  system  approach.  More  dynamic 
inference  rule  systems  are  described  in  [24],  [236]  -  but  with 
non-fuzzy  set  approaches,  where  inference  rules  are  "fired"  (or 

activated)  or  not,  depending  on  whether  the  antecedents  are  satis¬ 
fied  or  not.  See  also  the  surveys  in  [183]  and  [260].  Again,  we 
ask:  How  to  choose  the  particular  fuzzy  set  system  (<t'not>lt)&-<l)or)  • 

Recall  that  these  operations  may  be  considered  infinite-valued 
(i.e.,  [0, l]-valued)  extensions  of  classical  logical  operations. 

These  extensions  are  highly  nonunique.  Two  guidelines  may  be  used 
in  choosing  a  particular  fuzzy  set  system:  theoretical  consider¬ 

ations  via  isomorphic-like  relations  between,  e.g.,  semidistri¬ 
butive  systems  and  random  set  operations  (Chapter  6)  and  empirical 
considerations  ([250],  [291],  and  Chapters  8  and  2.3.5  .  See,  also 

[92]  for  further  discussion.)  In  dealing  with  human  informants, 
care  must  be  exercised  in  extracting  the  maximal  amount  of  unbiased 
information,  especially  in  the  modeling  of  inference  rules.  This 
requires  improvement  of  psychometric  techniques  in  modeling  such  as 
outlined  in  [195']  or  [21]  (especially,  the  first  chapter).  See 
also  [220]  and  [63]  for  systematic  approaches  to  modeling  of  member¬ 
ship  functions  from  a  subjective  viewpoint.) 

An  alternative  to  obtaining  inference  rules  completely  empir¬ 
ically  is  to  use  an  inductive  procedure  on  selected  samples  of  input 
attribute  matching  intensities  and  corresponding  observed  consequent 
(i.e.,  correlation  level)  matching  intensities  analogous  to  that 
found  in  [177]  or  [208]. 

It  is  often  desirable  to  be  able  to  interpret  the  output  of  the 
combination  of  evidence  -  the  tabulation  of  the  (normalized) 

posterior  function  <$>(0  |  diag  ;  ^,R,P)  vs.  0  ,  for  all  0  e  [0,1]  . 

If  the  function  is  monotone  increasing,  it  follows  that  it  may  be 
formally  interpreted  as  a  probability  distribution  function  over 
[0,1]  .  On  the  other  hand,  in  general,  this  will  not  be  so,  and  a 

Lebesgue-Stielt jes  signed  measure  interpretation  may  be  most  appro¬ 
priate.  (See  section  2.3.9.)  Correspondingly,  a  natural  single 
f igure-of-merit  appears  to  be  the  Lebesgue-Stielt jes  total  variation 
mean 
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fj i  =  J1  e  |  d  <J>(0  I  diag  ;  |,R,P) |  .  (99) 

9=0 

Another  f igure-of-mer it  is  based  upon  the  conditional  random  varia¬ 
ble  (W  |  Sy(BQ))  ,  where  Bq  is  the  fuzzy  set  represented  by  the 

above  posterior  function,  and  Sy  is  the  one  point  equivalent 

nested  random  set  (see  Chapter  5)  equivalent  to  Bq  ,  i.e.,  letting 

<t>c  =  <?»(•  |  diag  ;  §,R,P)  , 

Su(B0)  = 

o 

where  U  is  a  random  variable  uniformly  distributed  over  [0,1]  . 

For  any  outcome  Su(Bo) ,  (W  |  SU(BQ!)  is  uniformly  distributed  over 

Sy(B0)  ■  Then  the  desired  f igure-of-merit  here  is 

^  E ( W  |  Su(Bo))  (two-stage  mean)  =  E(W)  .  (100) 


When  <J>  is  monotone  increasing  (and  hence  <t>g  (1)  =  1)  , 

o  o 

then  the  above  finally  reduces  to 

=  (1/2  )IJ1  =  ( 1/2 )  ( 1  -  (  $B  (S)d9)  .  (103) 

9=0  ° 

Other  f igures-of-merit  may  be  constructed.  An  open  question 
connected  with  this  problem  is  the  determination  of  such  a  value 
which  is  invariant  with  respect  to  any  random  set  one-point-cover- 
age-equivalent  to 

o 


Some  as'v 


atotic  results. 


Asymptotic  results  were  previously  obtained  for  the  general 
combination  of  evidence  problem  in  section  8.4.  Here,  more  speci¬ 
fic  results  are  obtained  for  the  correlation  problem.  Consider  two 
fixed  track  histories  i  =  1  ,  j  =  2  . 
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Without  loss  of  generality,  suppose  are  statis" 

tical  attributes,  where  for  each  Ak  ,  the  discretization  result 


P  (Z 
k,  p  k 


*i*’>  * 


Z{t 

k 


*£*’»■ 


(104) 


/  »  \  o  1 1  \  mk  o  ( t  ) 

holds,  for  all  Z£  ' .  Z£  '  €  dom(Ak)  s  R  ,  where  ffc(-  |  Z£  )  is 

( t )  mk 

a  bounded  continuous  p.d.f.,  <  =  1,2  and  A^  (Z^  )  s  R  is  a 

sufficiently  small  m^-cell,  so  that  P^  {■  |  2^4 ^ )  is  a  probability 

function  ;  k  =  .  Here  p  indicates  the  pth  discretiza¬ 
tion  state  and  it  is  assumed  that  cells  p^Zk*  ^  ^  forming  the 

mk 

discretization  D,  of  R  refine  each  other  to-  ward  0  as 

k,  p 

p  -»  +  «>  Since  for  these  statistical  attributes,  although  in 

general  *  Pk  ,  has  a  form  derived  from  Pfc  or  at  least  the 

domain  for  Pk  (based  on  statistical  hypotheses  testing  using 

randomized  significance  levels  -  see  eqs.  (37)— (53) ) ,  M^.  is  also 

affected,  through  its  domain  replacing  Dk  ,  etc.  Thus,  if 

affected,  replaces  ,  k  =  l,...,m'  . 

Next,  suppose  Am • +  1 - • • • * Am • •  are  subjective  attributes  so 

that  error  and  matching  tables  coincide: 


pk.P«zi'1  '  *  "k.pX’X’1  ■  ‘  ■  l-2  ■ 


(105) 


for  all  «  dom(Ak)  ,  k  =  ra’+l,...,m"  ,  where 


here 


indexes  the 


th 


time  of  knowledge  concerning  Pfc  (and  thus  M^) 


The  contribution  of  the  pth  discretization  M.  for  M.  ,  for 

x ,  p  x 

all  k  ,  k  =  l,...,m"  ,  in  turn,  when  substituted  into  any 

inference  rule  R  ,  yields  the  corresponding  R,.  , 

t  *•  /  P 

t  =  1  ,  2  ,  .  .  ,  ,  r  . 

Suppose  that  the  posterior  function,  adding  subscript  p  ,  is 
indicated  by 


,(8)  =  <t>  (8 


diag  ;  |,R,P)  ;  8  e  [0,1]  , 


(106) 


where  equations  (90)— (98)  hold  (with  subscript 
Suppose  also  j  .  <P&2  and  $or  ’ 

satisfy  differentiability  conditions  (c),  (e), 

Corollary  1,  section  8.4. 

Then  the  following  result  obtains: 


p  properly  added), 
with  generator  h  , 

(g),  respectively,  of 
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Theorem  (See  also  Goodman  £91*  *  '  3 • ) 

Under  the  above  assumptions, 

♦diag,*-1*  I  i-5-S>  3  ♦dlag.p1”  '  l'5'£> 

=  1  -  h-1 ( min  (C (0 ,|) ,h(0) ) )  , 

e(fl;S)  ^  max  (r(8,Z";2))  , 

(all  Z") 


t(«.Z";| )  =  r  •  E  (v(<$.G  (0,Z;|"))  |  f')  , 

O  O 

(Z •  i Z *  ) 


(107) 
(  108) 


(109) 


where  the  statistical  expectation  E  ( •  )  is  with  respect  to 

(Z  ■  || •  ) 

Z'  ,  considered  now  formally  as  a  random  variable  with  p.d.f. 
f ' ( •  |  5‘)  ,  given  by 


f(2*  |  2')  3  ?r  (f-(z<4)  I  Si0))  .  (110) 

lSkSm1  , 

£=1,2 

I  i  1  mi r 

for  all  Z'  =  (Z.  )  ,  zl  1  e  R  ,  lSkim*,£  =  1,2, 

lSkSm ' 

where  constant  rg  for  q  =  2m’  is  given  in  (a),  Corollary  1, 

section  8.4  and  function  v  is  given  in  the  conclusion  of  the 
Corollary; 

<PG  (s.Z;|")  3  lim  <f>Q  (e,Z;|")  (111) 

o  ~  p-*+« 

and  as  in  eq.  (95) 


♦c. P(e = 


^.2‘P(p)<2" 


2"),  R(p)(9  I  2)) 


(112) 


for  all  Z  =  (Z',Z")  and  all  9  e  [0,1]  ;  p  >  1  . 


Proof :  Immediate  from  Corollary  1,  section  8.4. 

A  simple  example. 


=  1 


,  1  ,  2 


3 


Suppose  m' 


=  prod  , 
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<t>  =  probsum  ,  lim  P'!  (Z"  |  Z")  =  8  ,  and  r  =  1  with 

or  (p)  ~  ~  Z",l" 


R  (e  |  Z)  =  R1{e  |  Z‘) 


t’Qr ( 1  “  M^zJ^.zj25)  1  ,  e  X) 


=  1  -  M1(zJ1),Z1{2))  1  •  ( 1  -  e  1 ) 


(113) 


for  some  fixed  a  ,  >  0  ;  all  e  e  [0,1] 


Then  r2m,  =1  , 


y(x)  =  x  ,  all  x  e  [0,1]  .  Suppose  also,  f^zj*^  i  2^*^)  is 


the 


p.d. f .  at  Z 


<<  ) 


corresponding  to  the  Gaussian-Dirac  delta 


-1  a  ( z(t)  5{t]) 
2  A1(Z1  ,Z1  > 


(2*) 


I 


det  Z 


(<) 


(114) 


a1(z<<).2<*))  3  (YJO  -  *{«>,*.!<«>  *.  ( y 

zj‘>  3  (Y}«)tI(t))  , 


%(*■ )  d  o(i )  o(<  ) 
1  “  lYl  '1 


(<) 


y<‘>) 


(115) 

(116) 
(117) 


where  Y**  *  .yj*  *  e  r  1 


(i)  oil)  nl 'nl 

,  Zj  «  R  positive  definite,  so 


that  without  loss  of  generality  m1  =  n1  +  n^  ,  l  =  1,2  .  Suppose, 
finally , 

subsection  (C)  of  this  chapter.); 


Mj  arises  from  the  standard  test  statistic  (see 


M1(Z11) ,Z[2) ]  =  1 


yn1<A2,ZSII-ZlJ>" 


(118) 


where  F 
random  variable  and 


is  the  probability  distribution  function  of  a  >< 
nl  nl 


A  j  (  Z  ’  ) 


=  a2(zJx) ,zj2) ) 

=  (y|1,-y|2) ) (zj1)+rj2) )_1(Y<1)-Y<2) ) 


(119) 


Z(1),Z(2) 
1  '1 


for  all 
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Then  it  follows  that 


<p, .  __(«  |  2,R,P)  =  1  -  e 

Tdiag,+«  '  ~  ~  ~ 


-< (e ;2- ) 


(120) 


where 


t(e ;Z‘ )  =  i  -  (i  -  e  1: 


(d  "  F_  (X 


Xn1(A2(|* ) 


nl  ni ' A2 (~  '  ) 


))  ), 


(121) 


a  monotone  increasing  function  in  0  ,  so  that 
*univ;+~(diag)  =  1  ~  e_1 


(122) 


and  hence  for  all  0  e  [0,1] 

o  _! 

in  !  diag  ;  Z,R,P)  =  (1  -  e  )  /  (1  -  e  ) 

+a> 


(123) 


If  n.  is  even,  a  relatively  simple  form  exists  for  F  .  In 
l  1 

_  v  /  o 

particular,  for  n1  =  2  ,  F2(x)  =  l-  e  ,  x  i.  0  ,  and  hence 


t  (8 ; Z ' )  =  1  -  (1  -  a  *)  •  U(ai;Z')  , 


(124) 


where 


w(a1(|')  3 


_!  2 

e  ~  2  '  X  2  ,  A  2  (  Z  •  ) 


X2,A2(|> )  ' 


ai  +  1 


e  2 


r«,-Vi)  . 


(125) 


by  straightforward  manipulation  of  a  noncentral  chi-square  p.d.f.  as 
a  Poisson  mixture  of  central  ones.  Note  that 

4> ; m  ( 6  |  diag;2,R,P)  is  monotone  increasing  in  0  ,  up  to  1  at  0 

=  1  ,  and  for  any  fixed  0  ,  ,  i  ,  is  monotone  increasing  in 

the  weighted  squared  "distance"  a  ^  ( 2  ■  )  compatible  with  $+oo 

viewed  essentially  as  a  formal  probability  distribution  function 
(except  for  $^(0)  >0  in  general).  Finally,  note  that  as 

dj  -  0  , 


4*^(9  I  diag  ;  Z  ,  R ,  P )  - 


1 


)  /  (1  -  e  1)  , 


(1 


CHAPTER  10 


SUMMARY  OF  THE  MAIM  RESULTS  AHD  OPKH  QUEST I OHS 


10.1  Summary 

In  this  section,  a  brief  description  of  the  main  results  in 
previous  chapters  is  presented  together  with  concluding  remarks. 

In  the  Preface,  several  technical  and  mathematical  definitions 
concerning  uncertainty  modeling  were  discussed.  Including:  systems, 
states,  processes,  attributes,  system  uncertainties,  experiments, 
dispersions,  generalized  sets,  general  logical  systems,  truth  func¬ 
tionality,  multi-valued  logic,  negations,  t-norms  or  conjunction 
operators,  t-conorms  or  disjunction  operators  and  copulas. 

Chapter  1  provides  a  basic  introduction  to  the  subject  matter. 
Section  1.1  develops  motivation  and  a  basic  framework  of  analysis 
for  problems  Involving  linguistic  information.  The  use  of  observed 
data  (numerical  or  linguistic),  error  distributions  or  dispersions, 
and  inference  rules  (by  experts,  if  possible)  is  emphasized.  The 
general  combination  of  evidence  problem  is  also  introduced  as  an 
Important  objective  of  knowledge-based  systems.  The  problem  of 
Internal  vs.  external  modeling  and  the  development  of  general  logi¬ 
cal  systems  Is  also  discussed,  with  some  emphasis  upon  the  major 
problem  of  resolving  ambiguities  that  arise  in  modeling  linguistic 
information. 

In  section  1.2,  a  presentation  is  made  of  the  modeling  and  use 
of  natural  language  information  for  multiple-valued  logical  set 
theories.  In  addition,  some  basic  connections  with  random  set 
theory  are  also  mentioned.  Finally,  section  1.3  presents  a  brief 
outline  of  the  contents  of  the  text. 

Chapter  2,  symbolization  and  evaluation  of  language,  consti¬ 
tutes  a  large  section  of  the  text  -  necessarily  so  because  of  the 
foundations  it  lays  for  investigating  uncertainty  modeling.  Section 

2.1  is  succinctly  summarized  in  Figure  I,  where  the  flow  of  know¬ 
ledge  from  conception  to  evaluation  Is  symbolized.  The  pertinant 
aspects  of  this  transition  include:  the  real  world  as  a  problem 
generator,  systems  of  Interest,  cognition,  which  through  communi¬ 
cations  gives  rise  to  natural  language  expressions,  which  in  turn 
through  appropriate  parsing  techniques,  lead  to  formal  language 
definitions,  which  In  turn  through  semantic  evaluations,  lead  to 
general  logical  systems.  Finally,  combination  of  evidence  pro¬ 
blems  -  arising  from  real  world  generations  -  may  be  treated  via 
knowledge-based  systems,  consisting  of  a  general  logical  system  and 
a  feedback  loop  to  both  a  panel  of  experts  in  terms  of  their  ideas 
and  thoughts  and  to  the  real-world  effect  of  the  tentative  problem 
solution. 

Section  2.2.2  presents  the  basic  structure  of  a  formal  lan¬ 
guage.  A  summary  of  this  highly  detailed  section  is  given  in  part  V 
of  the  section.  A  formal  language  t  contains  a  category  Core(t) 
of  types  and  sorts  (kinds  of  things) ,  basically  consisting  of  formal 
objects,  arrows  and  relations,  together  with  function  symbols 
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(constituting  Ar{Core  (/.)))  representing; Cat (4 )  ,  category-function 
concepts',  Foun(4)  ,  set  theory  concepts  such  as  exponentiation,  mem¬ 
bership,  equivalence,  set  abstraction;  Loc(4)  ,  logical  connectors, 
including  negations,  conjunctions,  disjunctions  and  implications; 
and  Quan(4)  ,  quantifiers,  including  universal,  existential  and 
possibly  various  partial  quantifications.  In  addition,  4  contains 
ThSyn(£)  »  a  theory  of  syntax  establishing  the  relations  between  the 

function&lj  symbols  acting  upon  the  deduction  category  \?ar(4)  of  all 

basic  variables  of  4  .  Thus,  9ar(i)  consists  of  objects  which 
will  later  be  interpreted  as  generalized  or  fuzzy  sets  or  universes 
of  discourse  under  semantic  evaluations,  arrows,  which  play  the  role 
of  functions  among  the  objects,  and  relations.  Next,  4  contains 
wfex(i)  ,  the  class  of  well-formed  (according  to  Th  „_(/.))  expres- 

S  yll 

sions  of  4  -  essentially  compound  relations  or  well-formed  formulas 
Wf f (4 )  and  compound  terms  Wft{4)  .  Also,  4  contains  the  total, 
free,  and  bound  individual  variable  maps  TV (4)  ,  F V(4)  ,  BnV(4)  , 
where  (well-formed)  individual  variables  are  the  same  as  projection 
arrows  denoted  as  a  class,  W fv(/. )  ,  a  subclass  of  Wft(4)  .  Those 
maps  count  the  free  individual  variables  vs  the  bound  ones  in  any 
given  expression.  Finally,  4  contains  some  theory  ThK(4)  ,  in 

addition  to  the  basic  syntax.  As  presented  in  parts  of  the  text  - 
but  not  necessarily  so  everywhere  -  ThR{4)  is  chosen  as  an  entirej 

or  fragment  of  an  t  intuitionistic  theory,  encompassing  logical, 
quantitative  and  set  theoretic  aspects.  Basic  theorems  or  deducts, 
elements  of  De(Thk(4))  ,  are  given  in  subsection  (VI). 

Section  2.3.2  concerns  semantic  evaluations  n  n  of  a  formal 
language  4  within  a  deduction  category  (c,a)  .  Basically,  we 
restrict  ourselves  to  the  case  where  li  n  :  4  -»  (C,a)  is  a  model 
for  ThSyn^  and  Possible  also  for  Th(4)  ,  if  (C,»)  has 

additional  structure,  such  as  that  of  a  topos.  In  these  cases,  the 
semantic  evaluations,  i.e.,  the  ranges  of  n  11  in  (c,a)  are  com¬ 
patible  with  the  structure  of  4  ,  i.e.,  are  Isomorphic-alike. 

In  section  2.3.3,  semantic  evaluations  are  specialized  further 
to  Loc(4)  u  Quan(4)  ,  each  class  of  evaluations,  determining  a  par¬ 
ticular  logic  L  .  A  number  of  basic  definitions  is  given  here 
concerning  the  properties  of  semantic  evaluations  n  il  ,  theories 
Th(4)  ,  logics  L  ,  and  related  concepts  may  possess.  These  include: 
logical  implication,  validity,  satisfiability,  tautology,  contradic¬ 
tion,  tautologic  equivalence,  truth  functionality,  truth  functional 
completeness,  consistency.  Inconsistency,  deductive  soundness  and 
completeness,  logical  soundness  and  completeness,  model  consistency, 
axiomatizabillty  and  compactness.  Subsection  2.3.3(b)  establishes 
the  very  important  result  —  Theorem  1  (rephrased,  due  originally 
to  Coste ) : 


(a)  \?ar(4)  is  a  natural  deduction  category  (indeed  the  canonical 

one)  for  evaluating  t  with  model  il  ll^arjzj  :  4  -*  Var(4)  . 


(b)  Any  model  li  il  :  4  -*  (C,a)  (where  (c  ,a)  may  have  additional 
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structure  up  to  a  topos)  may  be  ctecomposed  into  a  composition  of  a 
map  from  Var (i)  to  (c,a)  with  11  u^arj£) 

(c)  Consequently  Thv{£)  (with  Th  (£))  is  a  sound  and  complete 

iv  syn 

theory.  By  identifying  any  category  c  ,  or  more  generally  any 
category  with  additional  properties  up  to  a  topos,  with  the  deduc¬ 
tion  category  (c,Sub)  ,  Theorem  1  can  be  reformulated  for  ordinary 
categories  or  topoi,  etc.,  (Theorem  2).  2.3.3(c)  consists  of  a 

brief  survey  of  various  particular  logics,  including  Lukasiewicz  of 
all  orders  (including  classical  logic),  Bochvarian,  GcJdel- 
Intuitionistic  (compatible  with  Th  (£)  here),  all  truth-functional 

IV 

logics,  and  the  non-truth  functional  Probability  Logic. 

Set  theory  concepts  are  discussed  in  section  2.3.4,  beginning 
with  a  selected  collection  of  set  theories,  including  Na’ive  Set 
Theory,  Zermelo-Praenkel ,  Maydole-Hay,  Intuitionistic ,  Novak's  and 
Weidner's  theories.  Some  emphasis  at  the  end  of  2.3.4  is  placed 
upon  the  Comprehension/Abstraction  Principle  in  forming  generalized 
sets  corresponding  to  wff's,  and,in  particular,  in  defining  general 
set  operations.  Maydole's  results  concerning  the  consistency  of 
Lukasiewicz  ^-logic  and  Maydole-Hay  set  theory  -  with  a  modified 

form  of  the  Comprehension  Principle  included  as  an  axiom  -  are 
pointed  out  as  a  positive  step  in  the  establishment  of  a  global 
generalized  set  theory.  In  any  case,  in  2.3.5,  general  logical 
systems,  are  Introduced,  in  which  it  is  hoped  that  in  some  "logical 
sense",  at  least  consistency  may  hold.  These  logical  systens  are 
treated  from  a  general  deduction  category  approach.  Connections 
with  knowledge-based  systems  are  discussed.  A  model 

ii  ii  :  t  -»  (GOG, Sub),  where  GOG  is  a  union  of  Goguen’s  category  of 
fuzzy  sets  >  Gog(H)  ,  with  appended  logical  operators,  is  chosen  as 
the  formal  definition  of  a  general  logical  system.  In  practice, 
GOG  could  be  replaced  simply  by  a  single  category  Gog(H)  and  a 
single  collection  of  logical  operators.  In  conjunction  with  general 
logical  systems,  the  class  of  generalized  or  fuzzy  subsets  of  a 
space,  and,  correspondingly  the  class  of  dispersions  on  a  space,  are 

introduced.  Essentially  each  object  variable  i  In  ?ar(t)  ,  is 
evaluated  as  Mill  =  (Xj,,91)  «  Ob(Gog(H))  ,  i.e.,  0 ±  :  Xd  H  ,  H  a 

fixed  Heyting  algebra  (truth  space),  X^  an  ordinary  set,  with 

i  =  A.  Identified  as  the  generalized  set  and  e ,  =  $  as  its 

i 

dispersion  or  membership  function  relative  to  Xi  .  The  evaluations 

under  n  ii  of  logical  connectors  “nt*  ,  ,  *or"  in  Loc(r)  are 

denoted  until  =  $nt  ,  ii&ii  *  <|>&  ,  Horn  =  4>Qr  ,  if  the  logic  is  truth 

functional.  Various  set  operations  (for  generalized  sets  here)  are 
defined,  including  products,  unions,  intersections  complements, 
relating  to  the  results  in  2.3.4.  In  addition  set  relations,  set 
abstractions  of  wffs  ,  and  power  class  and  subset  relations  among 

generalized  sets  are  evaluated.  Thus  the  wff  =  (x^  y^n  ^  )  , 

where  x^1*  ,  y^fi  *  are  In  Wfv(i)  and  («  :  0b(£)  -♦  Rel  (/.))  Is 
the  distinguished  function  symbol  representing  generalized  set 

membership  relation,  i  an  object  in  Var(i)  ,  can  be  evaluated  as 
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=  <x  (1>  «.  (v{q1)  u  z(Q  5)>  (formal) 
in  1  ni  in 


where 


:(1>  e  z<ft  >>  , 


f.  =  <x'“'  €  z'  '>  ,  etc, 
2  m  m 


Then 


iif ii  =  iif1li  Horn  iifjii 

-  *or  (»*,■ 


,(i) 


<4>  4  ('lx'  'll )  .  <p  j  (Hx'^'ll)) 


(i). 


<P 


'or,,v(*V  "n 

ym 

(llx<1)ll)  , 

(fl1),,  (A1) 
y '  'Uz'  ' 
m 


(Q1)’ 


with  similar  results  holding  for  intersection,  complementation  and 
other  generalized  set  operations.  Hence  a  calculus  of  generalized 
set  operations  and  relations  can  be  established. 

Furthermore,  wffs  need  not  be  restricted  to  only  membership 
relations  (again  see  2.2.2),  but  because  of  the  Comprehension/ 
Abstraction  Principle,  can  w.l.o.g.  be  considered  to  b  in  that 
form. 

Section  2.3.6  presents  an  extensive  survey  of  various  families 
of  operators  ,  representing  (through  semantic  evaluations) 

negation,  <f>.  ,  representing  conjunction,  and  *  representing 

disjunction.  <(>&  is  usually  called  a  t-norm  and  <|»or  a  t-conorm. 

A  number  of  important  properties  are  presented  for  various  sub¬ 
classes  of  such  operators,  including  Archimedean,  DeMorgan, 
distributive,  semi-distributive,  copulas  and  co-copulas,  Frank's 
family,  Yager's  family  and  others.  Sklar's  Theorem,  canonical  forms 
and  ordinal  sums,  as  well,  are  discussed. 

The  problem  of  extending  ordinary  functions  whose  domains  are 
ordinary  sets  to  those  whose  domains  are  generalized  sets  is  dis¬ 
cussed  in  2.3.7.  Basically,  two  approaches  are  considered:  If 
f  :  X  -*  Y  is  an  ordinary  function,  A  is  a  generalized  subset  of 
X  ,  A  «  ?(X)  ,  J (X)  £  U(X)  £  f(X)  ,  then  two  approaches  to  extending 
f  to  f_  :  *(U(X))  -*  »(D(Y))  are  considered  for  any  C  €  U(X)  , 

A  e  ? (U(X) )  , 


Approach 

(I)  : 

(A) 

(C)  ■  II  <C  €  ?(A)>II 

(II)  : 

-1 

Approach 

(A) 

(C)  =  II  < f  (C)  €  A> II 

(the  second  approach  being  compatible  with  uncertainty  mea¬ 
sures).  Applications  to  ihe  extension  of  classical  logical 
(Boolean)  connectors  are  also  presented.  Additionally,  semantic 
evaluations,  including  compound  operators,  intensifications, 
conditioning,  projections  and  Interactions,  Bayes'  theorem. 
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quantifications,  and  transformations  of  dispersions  and  linguistic 
variables  are  all  considered  in  section  2.3.8.  Especially  new  is 
the  approach  to  conditioning  (and  hence  q"=r'tif ication) ,  which 
generalizes  Zadeh's  fuzzy  cardinality  approach  in  a  natural  way. 

Section  2.3.9  (A)  treats  in  some  detail  Probability  Logic  (PL) 
(briefly  considered  in  2.3.3(C)).  Included  is  an  analysis  of 
Hailperin's  approach  to  PL  and  a  discussion  of  his  extension  of 
FrSchet’s  bounds  on  joint  probabilities.  Section  2.3.9  (B)  extends 
PL  to  the  more  general  Lebesgue-Stielt jes  Logic  (LSL) ,  where  essen¬ 
tially,  probability  measures  are  replaced  by  differences  of  such, 
corresponding  to  their  distribution  functions  being  replaced  by  the 
less  restrictive  class  of  functions  of  bounded  variation  with  ranges 
in  the  unit  interval.  An  extensive  appendix  on  functions  of  bounded 

variation  in  is  added  at  the  end  of  2.3.9  (B)  for  convenience 
of  reference. 

Section  2.4  presents  a  category  and  topos  theory  viewpoint  of 
generalized  set  theory.  The  basic  categories  Gog(H)  (proposed  by 
Goguen)  and  Puz(H)  (proposed  by  Eytan) ,  forming  pseudotopoi ,  and 
extending  Zadeh's  fuzzy  set  theory,  are  discussed  and  compared.  In 
addition, the  intuitionistic  modification  Fuzx(H)  of  Fuz(H)  and 

Higg's  topos  Higg(H)  are  also  analyzed.  Relations  between  those 
categories  are  estabished  and  the  controversy  over  which  candidate 
category,  or  suitable  modification,  is  most  appropriate  for  pre¬ 
senting  a  theory  of  generalized  sets  is  detailed.  Tie-ins  with 
models  of  Zermelo-Fraenkel  set  theory  are  also  briefly  considered  in 
section  2.4.2  (G)  Three  appendices  are  given  at  the  end  of  2.4.2 
covering  basic  category  and  topos  definitions,  deduction  categories 
and  formal  topos  properties,  Including  a  natural  realization  of 
ThK(£)  within  a  deduction  category  or  topos,  and  the  B&nabou  and 

other  constructions  of  standardized  deduction  categories  and  formal 
topoi.  The  latter  extend  In  a  natural  sense  arbitrary  deduction 
categories  and  formal  topoi  and  provide  a  general  basis  for  es¬ 
tablishing  generalized  or  fuzzy  sets,  GOG  being  a  particular  case. 

Section  2.5  briefly  presents  Zadeh's  approximate  reasoning  and 
fuzzy  logic  as  a  legitimate  logic,  with  an  associated  deduction 
system  compatible  with  the  semantic  evaluations  present. 

Finally,  Chapter  2  concludes  with  section  2.6,  where  a  number 
of  different  natural  language  sentences  are  translated  into  formal 
language,  and  in  turn,  semantically  evaluated. 

In  Chapter  3,  a  basis  is  established  for  investigating  uncer¬ 
tainty  measures  -  i.e.,  dispersions  whose  domains  of  definitions  are 
collections  of  subsets  of  given  spaces.  A  brief  survey  of  basic 
information  theory  concepts  Is  presented  in  3.1.  Choquet-type 
measures,  including  plausibility,  belief  and  doubt  measures  -  all 
generalizing  ordinary  probability  measures  are  introduced  In  3.2. 
Following  the  establishment  of  preliminary  relations,  characteri¬ 
zations  are  obtained  for  doubt,  belief  and  plausibility  measures  in 
terms  of  alternating  signed  sums  (Theorem  1).  Further  characteri¬ 
zations  relating  to  coverage  (subset  and  superset)  and  incidence 
functions  of  random  subsets  of  finite  (or  complementary-finite) 
spaces  are  presented  in  Theorems  2-4. 

Section  3.3  considers  possibility  and  related  uncertainty 
measures  and,  to  some  extent,  conditional  possibility  measures.  In 
addition,  basic  relations  between  various  classes  of  possibility  and 
other  uncertainty  measures  are  shown. 

Chapter  4  provides  an  introduction  to  random  set  theory.  The 
natural  topologies,  o-algebras  and  coverage  and  incidence  functions 
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associated  with  {non-finite,  in  general)  random  sets  are  presented 
in  section  4.2.  (An  appendix  treating  this  from  a  projection  view¬ 
point  is  given  at  the  back  of  4.2.)  Theorem  1  establishes  the 
construction  of  random  sets  from  primitive  ones.  Theorem  2  re¬ 
phrases  Dempster's  result  justifying  the  alternative  names  "upper" 
and  "lower"  probability  measures  for  plausibility  and  belief  mea¬ 
sures,  respectively ,  in  terms  of  non-precisely  known  (up  to  set 
values)  transformations  of  probabilities.  Theorem  3  establishes  for 
general  spaces,  doubt  measures  as  coverage  functions  of  certain 
regular  random  sets  (or  subsets)  and  plausibility  measures  as  inci¬ 
dence  functions  for  certain  regular  random  sets.  Similar  results 
are  also  established  for  belief  and  disbelief  measures.  Theorem  4 
presents  the  natural  imbedding  of  a  random  variable  as  a  random  set , 
compatible  with  the  results  of  the  previous  theorems.  The  converse 
of  Theorem  3  is  presented  in  Theorem  5:  Given  any  random  subset  of 
a  space,  its  subset  coverage  function  is  a  doubt  measure,  its  super¬ 
set  coverage  function  is  a  belief  measure,  its  incidence  function  is 
a  plausibility  measure,  etc.  Theorem  6  treats  approximations  to 
arbitrary  functional  transforms  of  random  sets.  Joint  random  sets 
and  set  operations  on  random  sets  are  also  considered.  General 
results  involving  Choquet's  theorem  on  capacities  are  presented  in 
4.3.  Multi-valued  mappings  and  Bochner  integrals  relative  to  random 
sets  are  considered  briefly  in  section  4.4. 

Chapter  5  treats  random  sets  and  dispersions.  In  5.1,  zero- 
one  stochastic  processes  are  analyzed  and  related  to  random  sets 
(Theorem  1)  and  to  doubt  measures  (Theorem  2)  and,  in  particular,  to 
semi-distributive  t-norms  and  t-conorms  (Corollary  1).  The  one- 
point  coverage  problem  is  treated  in  5.2,  where  the  basic  nested 
canonical  random  set  Sy(A)  and  the  maximal  entropy  random  set 

T(A)  are  considered,  among  others.  Flou  classes,  level  sets  and 
nested  random  sets  are  connected  in  Theorems  3  and  4,  extending 
previous  work  in  [184].  Additional  properties  are  obtained  in  sub¬ 
section,  5.2  (C)  Involving  isomorphic-like  relations,  based  on  one 
point  random  set  coverages  between  operators  on  generalized  sets  and 
corresponding  ones  on  random  sets.  Some  emphasis  is  given  to  the 
Sy-type  of  random  sets.  (See  also  Chapter  6.)  A  general  solution 

to  the  one  point  random  set  coverage  problem  for  the  finite  case  is 
presented  (with  some  geometrical  flavor)  in  Theorem  5.  One  point 
coverage  functions  generated  by  random  intevals  are  considered  in 
subsection  5.2  (E),  culminating  in  Theorems  6  and  7.  Theorem  8 
connects  random  variables,  generalized  sets  and  random  sets.  This 
results  in  the  relationship,  for  any  generalized  subset  A  of  space 
X  and  x  e  X  , 

II  (x  c  A )  I!  =  Poss(x  «  A) 

=  Pr ( x  €  S ( A) ) 

»  Pr { V( A )  «  x)  , 

where  S(A)  is  some  appropriate  random  subset  of  X  and  V(A)  is 
some  random  variable,  where  x  is  considered  a  compound  set. 

Theorem  9  of  section  5.3  treats  the  multiple  point  random  set 
coverage  problem  for  finite  spaces.  Also,  in  section  5.3  (Proposi¬ 
tions  1  and  2)  relations  are  established  between  Zadeh's  possibility 
measure  and  random  sets  by  a  direct  use  of  Matheron's  techniques 
(171).  Higg's  category  (topos)  and  multiple  point  coverage  func¬ 
tions  are  also  considered  to  some  extent.  In  5.4,  the  problem  of 
determining  extremal  -  entropy  random  sets,  one  point  coverage  - 
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equivalent  to  given  generalized  sets  is  presented.  Tie-ins  with 
Robbins'  earlier  results  concerning  expected  volumes  or  lengths  of 
random  sets  are  also  pointed  out.  The  fundamental  maximal  entropy 
theorem  for  discrete  spaces  is  reviewed.  Theorem  10  presents  the 
(otherwise,  unconstrained)  maximal  entropy  solution  to  the  one  point 
coverage  problem,  while  Theorem  11  considers  the  problem  with  an 
expected  cardinality  constraint.  Theorem  12  treats  the  minimal 
entropy  problem  for  one  point  coverages  by  random  sets,  followed  by 
some  simple  examples  and  counterexamples.  In  section  5.5,  inspired 
by  the  independent  work  of  Hooper  ([118]),  randomized  tests  of  clas¬ 
sical  statistical  hypotheses,  confidence  region  procedures  and  the 
one  point  coverage  problems  in  section  5.4  are  related. 

Chapter  6  concerns  general  problems  arising  in  the  isomorphic¬ 
like  (or  "weak"  homomorphic)  relations  between  operators  on  random 
sets  and  those  on  generalized  sets.  Following  an  extensive  develop¬ 
ment  of  mappings  relating  generalized  sets  to  one  point  coverage- 
equivalent  random  sets,  called  choice  function  families.  Theorem  1 
presents  conditions  when  arbitrary  combinations  of  cartesian  sum  and 
product  (or  union  and  intersection,  essentially,  equivalently)  oper¬ 
ations  have  isomorphic-like  counterparts  in  both  general  set  systems 
and  random  set  ones.  Theorem  2  determines  which  general  set  opera¬ 
tions  are  weak  homomorphic  to  given  set  operations  on  random  sets. 
Specialization  to  compositional  ordinary  set  operations  is  also 
carried  out  in  Corollary  1. 

The  converse  problem  is  also  discussed:  Given  any  generalized 
set  operations,  which  ordinary  set  operations  on  random  sets  are 
weak  homomorphic?  Some  examples  are  shown  where  generalized  set 
operations  admit  no  natural  corresponding  (weak  homomorphic)  random 
set  operations.  Theorem  3  addresses  the  problem  of  constructing 
weak  homomorphisms  between  generalized  and  random  set  operations 
which  depend  as  little  as  possible  on  a  particular  choice  function 
family.  Theorem  4  represents  weak  homomorphic  representations  of 
fuzzy  partitionings,  with  an  application  to  weighted  sum  operators 
(Corollary) . 

Chapter  7  presents  theories  of  uncertainty.  Some  connections 
between  various  logical  (and  other)  systems  Involving  information/ 
knowledge  flow  Is  given  In  Figure  1  in  section  7.1.  Manes'  very 
encompassing  "fuzzy  theories"  Is  treated  in  some  detail  in  7.2  and 
related  to  general  logical  systems  and  functional  extensions  or 
lifting  operators.  Watanabe's  fuzzy  set  theory  is  analyzed  in  7.3. 
Gaines'  uncertainty  logic,  an  attempt  at  simultaneously  generalizing 
both  probability  logic  and  Lukasiewicz  logic,  identified  with  fuzzy 
sets,  is  presented  and  critiqued  in  7.4.  Finally,  in  section  7.5, 
Schefe's  "agreement  probability"  approach  is  reviewed.  (This 
approach  mimics  to  a  certain  degree  ordinary  probabilities.)  In 
addition  some  of  the  controversies  Involving  fuzzy  set  modeling  are 
discussed . 

In  Chapter  8,  inference  procedures  are  developed  for  use  in 
knowledge-based  systems.  Knowledge  representation  in  general  is 
discussed  briefly  in  8.1.  Section  8.2  presents  the  outline  of  a 
general  decision  theory  based  on  dispersions,  replacing  classical 
probabilistic  notions.  In  section  8.3,  a  brief  description  is  given 
of  established  approaches  to  the  combination  of  evidence  problem. 
This  includes  the  following:  classical  Bayes'  theorem,  belief  and 
related  (Dempster-Shaf er )  uncertainty  measures,  and  certainty 
factors.  An  alternative  theory,  applying  the  spirit  of  section  8.2, 
is  presented  in  a  general  setting.  Two  short  results  (Theorems  1 
and  2)  are  used  as  guidelines  In  the  development  of  the  approach,  an 
application  of  which  is  given  in  Chapter  9.  Section  8.4  presents 
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some  asymptotic  results  connected  with  generalized  decision  theory. 
The  first  result  (8.4  (1))  is  an  analogue  of  the  classical  Law  of 
Large  Numbers,  involving  t-norm  and  t-conorm  operators  in  place  of 
the  (classical)  product  and  sum  operations.  The  next  result  (8.4 
(2)),  in  effect,  is  a  continuous  extension  of  the  first  to  an  un¬ 
countable  number  of  arguments  for  the  t-conorm  operator.  In  8.4(3), 
a  few  miscellaneous  approaches  are  presented  for  estimating  unknown 
dispersions.  (See  also  8.5  and  Chapter  9.)  In  8.4  (4)  additional 
asymptotic  properties  of  t-conorm,  t-norm  forms  arising  when  "fuzzy" 
data  is  averaged  are  obtained  (Theorems  1,  2).  Asymptotic  proper¬ 
ties  for  combination  of  evidence  forms,  when  information  granularity 
approaches  zero  is  given  in  Theorem  4  and  Corollary  1,  Theorem  3 
being  a  "guideline"  theorem  in  the  sense  of  Theorems  1  and  2  of  8.3. 
More  specifically,  under  relatively  mild  analytic  conditions  and 
appropriate  segregation  of  data  and  inference  rules  into  "analytic" 
and  "subjective"  groups,  the  limiting  posterior  dispersion  of  the 
unknown  parameter  vector  is  a  computable  transform  of  an  integral  - 
which  can  be  interpreted  as  a  classical  probabilistic  expectation. 

Chapter  9  consists  of  an  application  of  the  approach  developed 
in  Chapter  8  for  general  knowledge-based  systems  to  the  specific 
problem  of  data  association  in  multiple  target  tracking.  The  gen¬ 
eral  problem  and  background  information  are  given  in  part  (A).  The 
concept  of  error  tables,  as  conditional  dispersions,  is  presented  in 
(B)  with  tie-ins  to  one  point  coverages  of  random  sets  and  random 
variables  (see  also  Chapter  5).  In  part  (C) ,  inference  rules  based 
on  matching  tables  are  analyzed  and  related,  where  appropriate,  to 
classical  statistical  tests  of  hypotheses  and  decision  procedures. 
In  part  (D)  further  problems  involving  modeling  of  error  tables  and 
inference  rules  are  presented.  In  part  (E),  the  actual  combination 
of  evidence  is  carried  out,  in  the  form  of  a  posterior  dispersion 
describing  the  unknown  parameter  9  -  in  this  case  the  true  level 
of  association  for  any  given  pair  of  track  histories.  Interpreta¬ 
tions  and  further  uses  of  this  output  are  also  presented,  including 
approaches  to  obtaining  a  final  f igure-of-merit  description  of  e  , 
representing  in  a  general  sense  the  posterior  expectation  of  e  . 
Finally,  in  part  (F)  ,  Corollary  1  of  section  8.4  (4)  is  applied  to 
the  situation  considered  here  to  obtain  asymptotic  forms  for  the 
posterior  dispersions  of  9  . 

Chapter  10  concludes  the  text  with  a  subsection  (10.1)  devoted 
to  summaring  the  basic  results  of  the  text  (as  presented  above) .  In 
10.2  (A),  a  survey  of  controversies  and  criticisms  is  presented, 
detailing  particularly  10.2  (A) (II),  Watanabe's  criticism  of  Zadeh 
and  his  alternative  theory.  In  10.2  (A) (III),  Lindley’s  approach  to 
defining  admissibility  of  uncertainty  measures  is  given  together 
with  a  number  of  responses  to  his  claims  that  only  probability 
measures,  up  to  suitable  transforms,  are  admissible.  Finally,  10.2 
(B)  deals  with  the  various  research  problems  and  open  questions  that 
have  occurred  throughout  the  text.  At  the  back  of  section  10  Is 
added  an  appendix  on  the  modeling  and  semantic  evaluation  of  natural 
language  information. 
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10.2  Unresolved  Issues 


A.  Criticisms,  controversies  Involved  In  generalized  set  theory  anH 
probability  theory. 

(I) .  General  comments 


Despite  apparent  success  in  applying  fuzzy  set  theory  to  di¬ 
verse  disciplines  -  including  medical  and  fault  diagnoses,  pattern 
recognition,  clustering  techniques,  and  various  social/psychological 
areas  (see  again,  Dubois  and  Prade  [51]  for  a  full  survey)  as  well 
as  several  military  applications  (see,  e.g.,  Dockery  [49]  and  Watson 
et  al.,  [269]),  much  controversy  has  raged  between  those  advocating 
use  of  fuzzy  set  theory,  probability  theory  and  other  approaches  to 
the  modeling  of  uncertainties.  See  Stallings  [247],  [248]  vs.  Jain 
[122];  Tribus  [253],  [254]  vs.  Kandel  [129]  and  Zadeh  [284];  Zadeh's 
criticism  of  the  Dempster-Shafer  theory  [282],  Johnson  and  Shore's 
claim  [126]  that  maximal  entropy  and  other  standard  statistical 
techniques  supersede  fuzzy  set  modeling.  In  addition,  concerning 
the  use  of  fuzzy  logic  and  approximate  reasoning,  Haack  [102],  Tong 
and  Efstathiou  [251],  Schefe  [224],  and  Watanabe  [267],  [268]  have 
all  been  quite  critical;  Fox  [60],  on  the  other  hand  supports  its 
use.  (See  also  chapter  7  for  several  proposed  theories  of  uncer¬ 
tainties  . ) 

Let  us  consider  some  of  these  criticisms  and  suggestions  of 
alternative  theories  in  some  more  detail. 

Haack ' s  criticism  is  basically  directed  to  Zadeh's  second  order 
fuzzy  logic  [280],  [289],  where  fuzzy  set  membership  functions,  in 
effect,  have  ranges  of  values  (i.e.,  "truth"  values)  which  them¬ 
selves  are  fuzzy  sets,  as  opposed  to  Zadeh's  first  order  logic, 
where  ordinary  numerical  values  or  more  generally  lattice-values  are 
employed  for  the  membership  functions.  (It  is  the  latter  approach 
we  take  throughout  most  of  this  text,  not  the  second  order  ap¬ 
proach.)  Haack's  main  points  of  contention  include:  lack  of 
closure,  lack  of  use  of  formal  inference  rules  and  no  soundness- 
completeness  properties.  But  the  latter  are  missing  in  most 
multiple-valued  logics  anyway  -  not  just  Zadeh’s  approach.  (See 
section  2.3.3  or  Rescher's  comments  on  general  multi-valued  logic 
systems  [214],  pp.  161-166.)  (By  soundness  of  a  logical  system,  we 
mean  that  any  predicate  (or  relation  or  formula)  which  is  deducible 
from  the  system's  axiom  set  by  use  of  Its  inference  mechanisms,  is 
also  truth- table-wise ,  or  tautologically,  verifiable.  By  complete¬ 
ness  of  a  logical  system  we  mean  that  any  predicate  which  is 
tautologically  true  or  verifiable  Is  also  deducible  from  the  axiom 
set  of  the  system.  )  Other  desirable  properties  of  logical  systems 
Include  compactness,  decidability,  and  consistency.  (See  section 
2.3.3  for  discussions  of  these  concepts  for  classical  logic,  where, 
e.g,,  for  first  order  predicate  calculus,  soundness,  completeness 
compactness,  and  consistency  all  hold.  Again  see  Rescher  for  the 
problem  in  multi-valued  logics  [214],  pp.  62-66  of  the  failure  in 
general  of  such  logics  to  have  all  possible  operators  reducible  to  a 
primitive  set  of  operators  -  as  is  the  case  in  classical  logic.) 
Again,  it  should  be  reiterated  that  Zadeh's  fuzzy  set  system  9  is 

relatable  directly  to  Lukasiewicz  K ^  —  logic.  See,  e.g.,  Giles  [70 

-  73]  where  also  a  subjective  probability  approach  (see  also  section 
8.5)  involving  betting  is  established  for  fuzzy  set  membership  and 
operator  modeling,  although  it  should  be  noted  that  Lukasiewicz 
Logic  does  not  handle  fuzzy  quantifications  such  as  "some",  "many", 
etc.  (See  e.g.  Zadeh  [283].)  Finally  we  note  Gottwald's  work  on 
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Lukasiewicz  Logic  [94J ,  [95]. 

However,  if  enough  mathematical  structure  is  present  in  the 
form  of  a  topos  (see  also  section  2.4),  then  by  use  of  an  appro¬ 
priate  formal  language  (see,  e.g.,  Johnstone  [127],  pp.  152  -  64,  or 
see  sections  2.2  and  2.3  here)  3  a  theory  based  on  a  variation 
of  intuitionism,  may  be  developed,  with  corresponding  axioms  and 


deduction  rules,  which  is  sound  and  compl_i.e.  In  particular,  the 
important  topos  Higg  (H)  which  includes  Zadeh's  system  ,  i.e., 

Fuz(H)  as  a  subcategory ,  has  such  a  propety.  (See  again  section 
2.4.  ) 

In  addition,  Haack  complains  of  undue  formalism  in  Zadeh's 
treatment  of  vagueness  and  artificial  precision  (in  the  form  of  the 
numerical  valued  membership  functions)  -  but  of  course  Haack  tacitly 
accepts  an  analagous  precision  for  probability  functions  or  distri¬ 
butions  in  probability  theory  -  which  we  have  seen  is  a  special  case 
of  general  (fuzzy)  set  theory.  She  also  indicates  confusion  by 
Zadeh  in  the  interpretation  of  linguistic  quantifiers.  Fox  offers 
reconciliation  of  classical  logiq  and  Zadeh's  second  order  logic  by 
suggesting  use  of  "truth-prominance"  priorities  (borrowed  from 
production  rule  techniques  in  artificial  intelligence  theory)  in 
place  of  fuzzy  truths. 

Tong  and  Efstathiou  argue  that  fuzzy  truth  values,  such  as  "it 
is  very  truth  that",  do  not  need  to  be  formalized  relative  to  the 
truth  of  sentences.  They  also  agree,  in  effect,  with  Haack 's  cri¬ 
ticism  of  lack  of  closure  properties  for  fuzzy  truths  with  respect 
to  fuzzy  set  operations.  In  addition,  they  claim  Zadeh's  linguistic 
approximation  procedure  results  in  cumbersome  forms  that  are  not 
very  useful . 

In  Figure  I,  section  2.1,  a  summary  of  knowledge  flow  from 
conception  to  evaluation  is  presented.  This  text  has  emphasized 
natural  language,  formal  language,  general  logical  systems  and 
knowledge-based  systems  (the  latter  mainly  in  Chapters  8  and  9)  in 
treating  combination  of  evidence  problems  arising  from  real-world 
situations  (see  especially  Chapter  9).  However,  relatively  little 
attention  was  paid  to  the  upper  left  side  box  in  Figure  I:  cog¬ 
nition.  Some  of  the  controversial  work  of  Cohen  [38]  and  Kahneman 
and  Tversky  [128]  has  been  cited,  as  indeed  was  the  linguistic- 
cognitive  compendium  of  Winograd  [274’].  See  also  Anderson  [4']  for 
a  general  psychological  approach  to  cognition.  Another  area,  that 
of  "prototype  theory",  bears  some  comments.  Originally,  [40']  pro¬ 
totype  theory,  as  based  on  Zadeh's  fuzzy  set  theory  (for  other 
approaches  see  the  listed  references  in  [198'])  was  typified  by  the 
definition  of  a  "concept"  or  perhaps  better,  "extension  of  a 
conceot":  A  concept  A  is  identified  as  the  four-tuple  a  * 


(X.d.xo,q>ft)  ,  where  X  is  some  ordinary  set  (universe  of  discourse), 
d:XxX-»R  is  a  metric,  xq  «  X  is  the  "prototype"  and 

^A  :  X  ■*  [°,1J  is  the  membership  function  of  A  ,  so  that  for  anv 
x.  y  e  X,  if  d(x,x0)  S  d(y,xQ)  ,  then  9^(x)  e.  <|>a(y)  . 

Thus,  in  etrect,  this  forces  fuzzy  (or  generalized)  subset  .. 
o  X  and  xq  e  X  to  be  such  that  9A  is  unimodal  and  normalized 


'I  .e. 


w 


1) 


xq  .  Psychological  research  in  this  area 

splte  or  the  rather  weak  mathematical  efforts. 
'  th,®  9ulf  between  psychological  and  mathematical  analysis  of 
lon.is  n°  more  apparent  than  in  the  above-referenced  paper 
[198  ].  where  Mssrs.  Osherson  and  Smith  purport  to  show  Zadeh’s 
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fuzzy  set  theory  as  employed  In  prototype  theory  leads  to  contra¬ 
dictions.  but  classical  set  theory  doesn't.  As  one  example  of  this, 
consider  their  "incompatibility  with  concept  combination"  via  con¬ 
junction:  They  believe  that  if  X  =  all  fruit,  =  all  apples, 

A2  *  all  striped  fruit  and  Aj  B  A^  fi  A^  ,  xQ  or  an  x  close  to 

xQ  through  d  (as  illustrated  in  their  paper)  is  the  prototypical 

striped  apple,  then  in  effect 

•w ♦a1(x0)*  *a2(x0,)  =  \  (V  >  \  <x0>  ' 

violating  the  obvious  condition  <}>  (x)  £  (x)  ,  all  x  «  X  ,  by 

A3  A1 

supposed  intuition1.  ! 

Similar  gross  misunderstandings  of  basic  mathematical  relations 
take  place  in  their  second  "incompatibility  with  truth  conditions  of 
inclusion",  given  by  the  interpretation  of  the  sentence  "All  grizzly 
bears  are  inhabitants  of  North  America"  as: 

For  all  x  «  X  ,  ($A(x)  *  <f>B(x))  , 

where  X  is  the  class  of  all  animals,  A  is  the  fuzzy  subset  of' 
all  bears  and  B  that  of  all  inhabitants  of  North  America.  Obvi¬ 
ously,  this  interpretation  is  too  strong  when  they  obtain  the 
"fallacy"  for  x  =  Sam  ,  a  squirrel  living  on  Mars, 

<|>A{Sam)  >  <f>B(Sam)  ! 

As  a  quantification,  this  could  be  treated  as 

i>&  <VVX).VX,))  ' 

x«X 

or  through  Zadeh's  fuzzy  cardinality  approach 

W  *  V*A(x>'*B<x>>'  f*B<x>>  ' 

X* A  X€ X 

or  even  better  through  the  general  conditioning  approach  outlined  in 
2.3.8  (C)  and  (F) . 

On  the  other  hand,  the  recent  paper  of  Cohen  and  Murphy  [40’) 
extends  the  idea  of  "concept"  in  natural  language  to  a  "knowledge 
representation  model",  in  the  sense  of  AI  knowledge  representation 
through  a  schematic  display  of  the  various  subordinate  roles  a  con¬ 
cept  possesses.  In  turn,  this  leads  to  a  reasonable  approach  to 
quantifications  and  combinations  of  concepts  (in  their  sense).  ^How¬ 
ever,  they  accept  the  fallacies  of  [198* ]  and  indeed  attempt  naively 
to  avoid,  as  much  as  possible,  both  classical  set  theory  and  Zadeh's 
set  theory  in  a  formal  sense.  Oden  and  Lopes  [196"],  independently 
using  fuzzy  set  theory  (and  fuzzy  logic),  developed  a  more  detailed 
and  rigorous  approach  to  modeling  concepts,  but  still  retained 
psychological  aspects  of  cognition.  Their  approach-in  contradis¬ 
tinction  to  the  "similarity-to-ideal"  or  "to-instances"  approaches, 
compatible  with  [198']  -  apply  Zadeh's  original  ideas  of  fuzzy  pro¬ 
positions,  as  espoused  orlginaly  in  [290']  and  continued,  e.g  ,  in 
[286].  See  also  Niedenthal  and  Cantor  [194'J  for  further  discus- 
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sions.  See  Kempton  [136*]  for  interview  techniques  for  modeling 
category  prototypes. 

(II)  Watanabe's  criticise 


Watanabes  criticism  and  proposed  modification  of  fuzzy  set 
theory  (267]  also  bear  some  additional  analysis.  (See  also  section 
7.3. ) 

On  the  positive  side,  he  establishes  two  interesting  related 
mappings  connecting  fuzzy  sets  and  their  operations  with  ordinary 
sets  and  their  operations  in  a  nonprobabilistic  setting: 

For  any  (ordinary)  set  X  ,  define 

Q(X)  ^  (A  |  A  «  *(X)  &  0  S  <  1/2  ,  for  all  x  «  X}  , 

where  as  usual  ff(X)  is  the  class  of  all  fuzzy  subsets  of  X  while 
9 (X)  £  ?{X)  Is  the  class  of  all  ordinary  subsets  of  X  .  Then  de¬ 
fine  the  mapping  p  :  »(X)  x  a(X)  -♦  »(X)  ,  where  for  any  C  e  *(X) 

and  B  «  a(X)  ,  p(C,B)  «  ?(X)  ,  where  for  all  x  e  X 

d  Vx) 

^p(C,B)^X^  ~  VX)  +  ’ 

Then, 

(i)  p  is  bijective  (1-1  ,  onto)  with  Inverse  p-1  given  by,  for 

any  A  e  ?(X)  ,  p-1(A)  3  (a(A)  ,  w(A))  ,  where  A  =  p(e( A)  ,  «(A)) 

as  above  and  for  the  given  A  ,  the  core  8(A)  e  *(X)  and  frill 

u(A)  «  a(X)  are  given  for  all  x  «  X  : 

x  «  e(A)  iff  1  z  <j>A(x)  2  1/2 

X  «  8(A)  Iff  1/2  >  <)>A(X)  2  0  , 

W)(X)  =  1  VX)  "  ^8(A)(X)  1  * 

The  core  map  8  :  *(X)  -♦  9  (X)  is  of  course  surjective,  but  infi¬ 

nitely  many-to-1. 

(ii)  Let  k2.  k3  :  5(X)  x  S(X)  -*  O(X) 

and  kj  :  *(X)  -♦  Q(X)  ,  all  be  arbitary  but  fixed. 

Then  define  the  fuzzy  set  operations  for  complement,  inter¬ 
section  and  union  as  follows 

Ck  :  *(X)  -  *(X)  , 

1 

where  for  any  A  «  *{X)  , 

Ck  (A)  3  p( C (9 (A) )  ,  k  ( A) ) 

1  1 
,  U  :  ?(X)  x  S(X)  -  »(X)  , 

2  3 
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where  for  any  A,B  «  ?(X)  , 

A  n.  B  -  p (© ( A)  n  8(B)  ,  k_ ( A , B ) ) 

2 

A  U.B  ^  p  { 8 ( A )  U  8(B)  ,  k,(A,B))  . 
*3 


Then  it  follows  immediately  that  for  all  A  ,  B  «  *(X) 

8(A)  =  0  iff  sup  8  (X)  <  1/2  ; 
x«X 

9(C,_  (A))  =  C ( 8 ( A) >  ; 

1 

8 (A  D  B)  =8 (A)  n  8 (B) 

K2 

8 (A  U  B)  =8 (A)  U  8  (B)  , 

3 


and  hence  e  is  also  a  homomorphism  w.r.t. 
(C,n,u)  ,  ate. 


) 


and 


(iii)  We  can  extend  Watanabe's  results  further.  Consider  first  any 
binary  fuzzy  set  operation  ®  :  s(X)  x  s(X)  -♦  ?(X)  which  extends 
some  ordinary  set  operation  *  ,  i.e., 

(®  |  *(X)  x  9  (X))  S  »  , 


wherfe 


*  :  »(X)  x  »(X)  -  »{X) 


may  be  arbitrary  {but  fixed) .  Define  function  t  by 


*  (u) 


iff 

iff 


0  s  u  <  1/2 
1/2  5  u  S  1  . 


Suppose  also  ®  is  in  composition  form  i.e.  there  is  a 

g  :  [0,1]2  -»  [0,1]  with  g(0,l)2  -»  {0,1)  such  that  for  all 
A , B  «  5 ( X )  ,  and  x  «  X , 


^a  ®  b(x)  =  g<|t,A(x)  ■  <PB(x))  • 


Then  it  follows  readily  that  8  is  a  homomorphism  with  respect 
to  ®  defined  through  g  iff  for  all  u,v  e  [0,1]  , 

g(*u,  *v)  =  *g(u,v) 

iff  for  all  u,v  e  [0,1]  , 

g(u,v)  =  g{ t u ,  *v)  +  (-l)g(,u'  *v)-h(u,v)  . 
where  0  5  h  (u,v)  <  1/2  is  arbitrary  . 

In  particular,  for  *  *  n  ,  g  must  satisfy  the  boundary  con- 


Summary  of  Main  Results  and  Open  Questions 


557 


dltlon 


g(0.0)  -  g(0, l )  =  g{ 1 , o )  =  o  ;  g(l,l)  =  1  . 

Thus  for  9  to  be  a  homomorphism  for  (n)  defined  through  g  and 
for  n  ,  we  must  have: 

1  i.  g  (u,v)  £  1/2  for  all  1  k  u,v  k  1/2  ,  0  £  g(u,v)  <  1/2  for 
all  0  £  u,v  £  1  ,  otherwise. 

Hence,  g  =  min  as  shown  in  the  paper  will  do,  but  g  =  prod 
will  not,  since  for  example  prod{ 1/2 , 1/2 )  =  1/4  <  1/2  . 

Similarly  for  *  ■  u  ,  g  must  satisfy  g(0,0)  =  0  ,  g(0,l)  = 
g(l,0)  =  g(l,l)  =  1  ,  implying  for  «  to  be  a  homomorphism  for  (u) 
and  u  ,  we  must  have  for  (u)  through  g  :  0  £  g(u,v)  <  1/2  for  all 
0  £  u,v  <  1/2  ;  1  k  g(u,v)  k  1/2  for  all  0  £  u,v  £  1  ,  otherwise. 
Hence  g  =  max  will  work  but  g  ■  probsum  will  not  since,  for 
example,  probsum(3/8,  3/8)  =  39/64  >  1/2  . 

Next  if  ®  :  *(X)  -#  9  (X)  is  a  unary  operator,  extending 

*  :  *(X)  -*  *(X)  which  corresponds  to  (g):  [0,1]  -»  [0,1]  ,  with 

g  :  (0,1)  -♦  (0,1)  ,  then  for  all  A  «  9  (X)  ,  x  «  X 

4>  (x)  -  g(1>A(x))  . 

©  A 

Then  as  above,  e  is  a  homomorphism  w.r.t.  ©  and  *  ,  iff, 
for  all  u  «  [0,1]  , 

g(tu)  *  *g(u) 

iff 

g(u)  =  g(tu)  +  (-l)9^tulh(u)  ,  0  <  h(u)  <  1/2  arbitrary. 

Hence  for  *  *  C  ,  g  must  satisfy  g(0)  =  l  ,  g ( 1 )  =0  , 

whence  for  9  to  be  a  homomorphism  for  ©)  and  c  ,  we  must  have  for 
©  ,  through  g  : 

0  £  g(u)  <  1/2  ,  for  1  k  u  k  1/2 

1/2  £  g(u)  £  1  ,  for  1/2  >  u  2:  0  , 

yelding  many  choices  for  ®  including  that  for  g(u)  <*  1  -  u  . 

On  the  critical  side,  Wantanabe  joins  Schefe  (see  section  7.5) 
in  the  desire  to  preserve  all  of  the  basic  laws  of  Classical  Logic, 
especially  the  Law  of  the  Excluded  Middle,  and  criticizes  the  fuzzy 
set  membership  concept  as  involving  over-precif ications  and  adds 
(with  Manes  [170])  the  lack  of  empirical  ways  of  verifying  fuzzi¬ 
ness.  (See  also  Chapter  7  for  various  theories  of  uncertainty.) 

Arbib  [6]  has  raised  the  issue  of  the  lack  of  well-organized 
texts  in  fuzzy  set  theory,  while  Manes  [168],  [170],  questions  (1) 

the  validity  of  fuzzy  set  theory  as  a  legitimate  branch  of  mathe¬ 
matics,  by  pointing  up  the  lack  of  analogues  with  the  Central  Limit 

Theorem  and  Law  of  Large  Numbers  of  probability  theory  as  well 

as  the  apparent  arbitrariness  and  (2)  inconsistencies  occurring 
in  "fuzzifying"  or  extending  ordinary  set  concepts  such  as  the 
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"extension  principle"  -  the  fuzzy  set  extensions  of  ordinary  func¬ 
tional  transforms  operating  on  ordinary  set  or  points.  (See  also 
section  7.2.) 


(IXI)  Admissibility  of  uncertainty  measures  r91'1. 

Recall  that  an  uncertainty  measure  can  be  considered  as  a  map¬ 
ping  r  :  A  -»  I  ,  where  (X,.*)  is  a  measurable  space,  i.e., 

x  £  »(X)  is  a  o-algebra  and  IQ  £  R  is  a  closed  interval.  A 

number  of  relations  were  developed  between  various  classes  of  uncer¬ 
tainty  measures  in  Chapters  3,  4  and  5,  but  no  specific  comparisons 
were  established  showing  one  class  of  uncertainty  measures  was 
"better"  than  another.  Indeed  the  theme  has  been:  each  situation 
dictates  a  perhaps  different  appropriate  uncertainty  measure. 
However,  Lindley  [158]  generalizing  Savage's  earlier  work  [221*], 
pointed  out  that  by  using  additive  scores,  uncertainty  measures  can 
be  considered  as  decision  functions  relative  to  event  sequences. 
Consequently  it  is  meaningful  to  consider  the  class  of  Bayes  uncer¬ 
tainty  measures,  etc,,  among  the  many  items  one  can  investigate 
within  a  statistical  decision  theory  framework. 

Lindley  concluded  that  only  uncertainty  measures  that  are 
certain  transforms  of  prooability  measures  are  admissible  and  large 
classes  of  nonstandard  ones,  i.e.,  upper  and  lower  probabilities, 
confidence  statements,  and  possibilities,  are  all  inadmissible. 
Because  of  these  rather  strong  conclusions  and  the  informality  of 
Lindley's  argument,  a  more  detailed  decision-theoretic  presentation 
is  given  here  together  with  some  clarifications  and  modifications  of 
the  original  conclusions. 

(I)  Uncertainty  measures  can  be  considered  as  decision  functions 
relative  to  the  following  framework: 

Let  :  HQ  3  {0,1}  ,  Hx  ^  [0,1]  .  Let  (X,a) ,  (Y,a),  (Z,C)  be 

measurable  spaces,  i.e.,  x  £  *(X),  a  £  a(Y),  c  £  a(Z)  are 

o-algebras,  in(x)  *  {«  |  e  £  x  and  e  Is  at  most  countable)  , 

P  :  Y  -•  ?„,(•*)  ,  q  :  Z  -*  x*  arbitrary  known,  fixed  maps,  repre¬ 
senting  choi ce  maps. 

x  is  the  collection  of  basic  events  on  X  , 

0  -  X  x  1m{X)  =  parameter  space , 

^  (  (  (g(B)  )Bec  ,g)  I  C  e  .  g  «  xA]  =  cfata  space  , 

a  =  U  [a,b]C  (  a  <  b  fixed  real  constants)  =  decision  space, 
ee}w(A) 

8  X  x  Y  -*  n  is  the  parameter  map,  where  for  all 

n  «  v  , 


» (« ,n ! 


£ 


(<•>  .  P (7  )  )  , 


u  €  X  , 
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7 

V  :  n  -♦  is  the  data  map,  where  for  all  0  =  (u,C)  €  n  , 


all  i  «  Z  , 

V  ( 8  )  ( •* )  =  (  ( q  ( "* )  (B)g(  .q(-O)  . 

g  C  9^  is  the  decision  function  space,  so  that  any  r  €  g  , 
and  any  y  =  ng(B)a€C,g)  «  U  .  e  «  »  3  «  **  » 

*"  ( y )  =  (rB(9(B)  )  )Bee  ,  where  for  all  Bee, 

r  ( g(B) )  =  uncertainty  of  event  B  occurring,  given  g(B)  is 
0 

observed  to  occur. 

Each  such  r  ;  ^  -»  a  is  a  conditional  uncertainty  measure  map, 
and  we  use  the  notation,  for  any  g  e  ax  ,  (r|g)  :  a  ,  where 

for  any  e  «  J0o(-A)  » 

(r|g)(e)  ^  (rB(g(B)))B€e- 

Also  define  the  score  function  class 
(0)  sc  =  { f | f  :  [a,b]  x  HQ  -  R+  satisfies  regularity  condition 


f  is  continuously  differentiable  in  its  firs'*-  argument 
with  f(x,t)  nonincreasing  in  x  ,  for  all  ten 
x  s  xt  f  .  for  some  x^  f  ,  for  some  f  >  0 

<5f(Xt  f,t)/<5xt  f  =  0  , 

and  f(x,t)  nondecreasing  in  x  ,  for  all  x  i  x  .  , 

x  j  r 

and  +  ®  >  M.  =  max  sup  f(x,t)  ; 

t€HQ  xe  [  a  ,  b  ] 

3  <  x0,f  <  xl,f  <  b  ’ 

Define  for  each  f  e  X  ,  loss  function 


:  9  x  Q  -*  r 


where,  for  any  «•  e  and  h  e  [a,b]  ,  and  any 

0  =  (u,e")  «  n  ,  we  have  the  additive  form 


d  ,  +®  ,  if  e •  *  t  " 

L  (h,8)  =  z  f(h(B)  ,<}>  («) )  ,  iff  e •  =  e "  , 

r  •  B 


where  <$>:*-»  {0,1}  is  the  ordinary  membership  function  map,  i.e., 
for  any  B  «  x 
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Vx) 

-  fl 

iff 

X  € 

B 

lo 

iff 

X  < 

B  . 

Assume  all 

maps 

9  , 

v,  r  , 

L.  are  measurable  in  the  usual  sense 

for  the  appropriate  measurable  spaces. 

In  turn,  the  usual  decision  theory  definitions  apply  here 
including  conditional  risks,  posterior  risks  and  Bayes  decision 
functions  with  the  above  interpretations. 

Define  also  for  each  f  e  X  ,  the  class  of  all  ij-ada/ssib/e 
uncertainty  measures 

•y-admissf  =  {r  |  r  «  <3  and  for  each  y  3  (g(B))B€<;  €  ^  ' 
there  is  no  r*  e  g  such  that  eq.{2)  holds) 

(2)  Lf  (r  ■  ( y),e)  i  Lf(T(y),9)  ,  all  e  «  n  with  strict  inequality 

holding  for  at  least  some  sen. 

Define  also  the  class  of  all  locally  admissible  uncertainty 
measures  as 

loc-admisf  ^  (r  |  r  «  5  and  there  is  no 

y  =  (  (g(B)  )Be(.  ,g)  «  •))  such  that  eq.(3)  holds) 

(3)  (total  differential)  (d  L-(u,8))  0  ;  all  Ben, 

with  strict  inequality  holding  for  at  least  some  sen. 

Equivalently, 


(  3  •  ) 


B«e 


<5  f  (  X  ,  <|>g  ( w  )  ) 


<?x 


s  0 


JX=rB(g(B) ) 


for  all  9  =  (u,e)  e  n  ,  with  strict  inequality  holding  for  at  least 
some  8  . 


Similarly,  for  each  y  a  Y  ,  define 

loc-admiss,  .  =  (r  |  r  e  g  and  eq.(3)  does  not  hold). 
r  >  Y 

Next,  define  the  H  ndl ey-tranat  orm 

:  x  -*  *  (surjective) 

where 

*  -  { Q !  Q  :  [a’,b‘]  -♦  [0,1]  is  continous,  nondecreasing 
and  such  that  Q(a’)  =0  ,  Q(b')  =  1)  ; 


a  s  a '  <  b'  s  b 
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and  for  any  f  «  X  ,  and  all  xQ  {  £  x  £  x^^  ^  ,  define 

d  (<5f  (x,0)/«x) 

(4)  Pf  =  <5f(x,0)/ax)  -  (<5f  (x,  1 ) /<?x)  ’ 

In  turn,  define  for  each  f  «  X  ,  Bindley 's  class  of  uncertain¬ 
ty  measures  as 

Lindj  3  {r  |  r  «  q  and  there  is  a  probability  measure  v 

malting  (X,-*,w)  a  probability  space  such 
that  eq.{5)  holds} 

(5)  Pf(rB(g(B)))  =  v  (B  |  g(B)) 

*  »  ( Bflg  ( B ) )/u(g(B) )  , 

for  i/  (g(B)  )  >  0  ,  for  all  B  e  j.  ,  g  e  aa  ,  i.e., 

Pf{Tg(F))  =  u(B  |  F)  ;  for  all  B.F  «  A  . 

(II)  It  follows  that  for  all  f  e  X 


(6)  "y-admiss.  £  loc-admissf  =  n  loc-admiss . 

y«U  ,y 


£ 

E3 

Z1  n  z2  n  Z3 

Lindf  , 

where 

(7) 

■Si  4( 

iY  1 
• 

Y  = 

9(8) 

i  , 

c, 

0, 

^  b  *  l  •  a11  c'  F  €  *1- 

(8) 

3  2 

fy 

1  Y 

=  (g(B))B€e  €  T  ; 

9(B) 

i.\ 

c, 

0, 

if  B  *  F,  C  F  all  c  p  .  ,1 
if  B  *  F,  C  F  *  aI1  C’  F  XJ 

(9) 

<y  S 

*3 

[y 

1  y 

=  (9(B))B6C  €  ^  ; 

9(B) 

|f  n 

G  if  B  =F,  F  n  H 

G,  if  B  =  H  all  B,  F, 

0,  if  B  *  F,  F  n  H , H 

(10)  z  =  n  loc-admiss. 

Y^°i1  'V 

»  (r  |  r  «  ?  and  for  all  F,  C  6  a  , 

if  C  *  0  ,  rc( C)  =  X1#f  ,  rcc(C)  =  XQ ^ f 
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rp(C)  «  [xQ(f  ,  xltf]}  , 


(11)  z2  =  n  loc-admissf  =  {r  |  r  €  <3  and  for  all  F,  C  «  a  , 


ys'S 


Pf(»-F(C))  +  P#(rnc,(C))  =  1}  , 


f '  CF' 


(12)  z  =  n  loc-admiss  =  {r  |  r  «  5  and  for  all  F,  G,  H  «  a  , 
ye?  r,y 

pf(rFnH(G)>  =  WG)>  *  Pf(^H(Fnc))}  . 


(Ill)  Now  let 


(13)  *jj  ^  (f  |  f  e  M  Is  such  that  in  eg.  (1)  "nonincreasing"  and 

"nondtwreasing"  are  replaced  by  “strictly  decreasing" 
and  "strictly  increasing",  respectively). 

Then  it  follows  that  for  all  f  e  XQ 

Pf  s  [xQ  j/Xj  -*  [0,1]  is  continuous  and  "strictly  increas¬ 
ing",  and  for  any  t  «  , 

inf (f (x, 1 )  •  t  +  f (x, 0)  •  ( 1-t )) 

X€[x0,f ,xl,f] 

occurs  uniquely  at  x  «  xt  f  ,  where  by  differentiating  w.r.f.  x  . 

(14)  pf(xt>f)  -  t  , 


i  .e.  , 


(14’)  xt  f  >=  Pf  X(t)  ,  for  all  t  «  Hj  . 

(IV)  Now  for  any  f  «  H  ,  any  probability  measure  /u  such  that 
(XxYxZ,o{ax»xC),^}  is  a  probability  space  such  that 
marginal  probability  measures  and  £fyxZ  are  independent,  for 

any  r  «  g  and  for  any  y  3  (yB)Bee  ^  (rB(g(B)  )  ,  g)B€t  the  pos¬ 
terior  risk  is 


(15)  rf(p.r,y)  £ 

E(Lf(r(V(e(- ,..))(••• )),  »(-,••))  I  V(e (•,••))(••• )  =  y) 

=J,Kf(YB>  *  #,YxZ(p"1<*)  *  ’ 

B€  C 


where 

(16)  Kf(yB) 

=  E  (  f  ( r  B  (g  (B )  )  ,  <J»B  (  •  )  )  I  ( q (  •  •  M  B )  .  q(  .  .  )  )  =  y  ) 
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-  f(rB(g(B)),l)./ix(g(B)  n  B)  +  f  { r  g  ( g  ( B )  )  ,  0 )  •  px(g(B)  H  B) 

-  wx(g(B))-  (f(TB(g(B).l)-/ix(B|g(B))+f(rB(g(B))  ,0)* 

•  <l-px(B|g(B))))  . 

Then  applying  the  results  in  (III)  to  eqs.  (15)  and  (16), 
j'nf  r-(^/,r,Y)  occurs  uniquely  for  the  Bayes  uncertainty  measures 
re  G  1 

r  =>  g  ,  where 

(17)  Pf(r^'f)(g(B)))  =px(B|g(B))  . 
for  all  B  e  A.  ,  g  e  aX  . 

Conversely,  if  equation  (17)  holds  for  some  f  then 

r(fj,f)  is  a  (unique)  Bayes  uncertainty  measure  for  the  posterior 
risk.  Since  all  uniquely  determined  Bayes  decision  functions  are 
^-admissible,  combining  these  results  with  eq.(6)  yields 

(18)  Bayes ^  =  ‘y-admis^  *  loc-admiss^  =  Lindf  , 

for  all  f  e  JC0  ,  with  the  form  of  the  Bayes  decision  functions 
given  above  in  eq.  (17). 


Remark  1 . 

Lindley  shows  that  a  number  of  well-known  classes  of  uncertain¬ 
ty  measures-including  one-sided  confidence  intervals,  significance 
statements,  Zadeh’s  possibility  measures  and  upper  and  lower  pro¬ 
bability  measures  -  are  not  ‘y-admissible.  This  is  accomplished, 
noting  eq.  (6),  for  any  such  candidate  uncertainty  measure  r  ,  that 
r  €  Lindj  ,  i . e . , 

Pf  (r.  (•  •  )>  *  vx(-  |.  •  )  ; 

for  all  probability  measures  vx  over  X  relative  to  a  ,  and  all 
f  e  3C  (or  f  «  ) .  This  is  equivalent  to  showing  either  (or  both) 
eqs.  (19)  and  (20)  do  not  hold: 

(19)  Pf(rp(C))  +  Pf(rCF(C))  -  1  ,  all  C,  F  €  a 
and/or 

(20)  P£(rPnH(G))  *  Ff(rF(G))  •  Pf(*H(FnG))  ,  for  all  F ,  G ,  He  A  . 

However,  Lindley  does  not  consider  'except  for  his  "proper" 
case  of  f  constrained  so  that  Pf(x)  s  x)  the  validity  of  tne 

class  of  all  re  G  such  that  r  {••)  itself  is  a  (conditional) 

probability  measure  family. 

Indeed,  the  answer  in  general  is  negative: 
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Theorem  1 


Under  the  same  basic  assumptions  as  in  part  (I),  where  now 

xo  *  0  '  xi  =  1  : 

If  r  e  g  is  such  that  r.  (••)  is  a  non-atomic  conditional 


probability  measure  family, 
measure  w  making 


i.e.  there  is  a  non-atomic  yrobabi 1 ity 
a  probability  space,  such  that 


(21) 

rp(C)  =  v ( F | C)  ,  all  C.  F  «  * 

I 

with 

v (C)  *  0  ,  then: 

(i)  Unless  r  is  constrained 

so  that 

(22) 

Pf (x)  *  x  ,  for  all  x  e  , 

such 

as  for  Savage's  choice  [221], 

(23) 

f(x,t)  =  (x-t)2  ,  all  x  € 

[a , b]  ,  r 

then 

r  is  not  ‘y-c.^.-iissi  ble . 

(ii)  When  t  is  chosen  so  that  eq.  (22)  holds,  then  r  is 
“^-admissible . 


Proof:  The  proof  involves  a  basic  technique  in  functional  equa¬ 

tions.  Since  r  (••)  is  non-atomic,  eqs.  (19)  and  (20)  become 

(24)  Pf(x)  +  Pf(l-x)  =  1  ,  all  x  €  [0,1] 

(25)  Pf(w-z)  =  Pf(w)  •  Pf(z)  ,  all  w,z  e  [0,1], 

Differentiating  eq.  (25)  first  w.r.t.  w  and  then  w.r.t.  z 
yields,  for  all  w,z  «  [0,1]  ,  the  primes  denoting  derivatives, 

z  •  P^(w-z)  =  P£(w)  •  Pf(z)  , 

w  •  P'(wz)  =  Pf(w)  •  P^(z)  , 

whence 

P£(w)  •  Pf(z)/z  =  P'.  (z).Pf(w)/w  . 


i.e. 


z  •  d  log  Pj(z)/dz  =  w  •  d  log  P^(w)/dw  , 


implying 


z  •  d  log  P^(z)/dz  =  const,  ;  all  z  «  [0,1]  , 


which  yieius  the  solution 
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pf(z)  =  z  ;  all  z  €  [0,1]  , 
for  any  fixed  k  >  0  . 

However,  use  of  eq.  (25)  immediately  implies  that  k  =  1 


Corollary  1 


Under  the  same  basic  assumptions:  If  r  is  “((-admissible  and 
is  such  that  it  can  continuously  assume  all  values  in  [0,1]  ,  then 
P(r  (•*))  is  an  atomic  probability  nonsure  family  and  hence  not 

^-admissible,  unless  eq.  (22)  holds.  Similarly,  in  general,  all 

iterates  P^(r  (*•))  *  for  j  =  1,2,...  ,  are  not  “((-admissible . 


Remark  2. 

The  above  result  may  be  weakened  as  follows;  If  r  is  com¬ 
plementary,  i.e.  satisfies  eq.  (19)  and  continuously  assumes  all 
values  in  ,  then  if  r  is  “((-admissible  ,  eq.  (24)  must  hold, 

i.e., 

(26)  f ' ( x , 1 )  •  f ' ( 1 —x , 1 )  =  f“(x,0)  •  f • ( 1-x, 0) 
for  all  x  «  [0,1]. 

Thus ,  e . g . , 

(27)  f(x.t)  =  t- |x-t|kl  +  (l-t)xk2  , 

for  all  x  €  [0,1],  with  >  1  '  kl  *  k2  constants,  will  not 

satisfy  (26),  and  consequently  r  connot  be  “((-admissible  for  that 
choice  of  f  . 


Remark  3. 

All  uncertainty  measures  can  be  extended-in  the  sense  to  be 
described  below  -  to  probability  measures  over  higher  order  spaces, 
which  in  turn  are  all  “((-admissible  relative  to  f  chosen  as  in  eq. 
(23)  ; 

Let (ft, a, Pr)  be  a  fixed  non-atomic  probability  space  and 
U  :  a  -*  Hj  ,  a  uniformly  distributed  random  variable.  Let  (X,*) 

be  a  measurable  space,  i.e.,  A  S  »(X)  ,  Is  a  o-algebra  on  X  ,  and 

in  turn,  (a,B)  be  a  measurable  space. 

For  each  A  e  a  ,  define  the  filter  class 

(26)  cA  S 


and  let 


(e  |  A  €  c  e  a) 
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(29)  c  &  o{cA  !  A  €  a)  s  »(;)  , 
forming  the  measurable  space  (a,  e)  . 

Define  also  the  class  of  uncertainty  measures  on  a.  ■. 

(30)  ■*(-*)  3  (r  |  r  :  ji  -+  H1  is  a  random  variable,  i.e., 

(a,  B^)  -  measurable) 

and  the  class  of  all  probability  measures  on  c  : 

(31)  9>r(c)  -  (u  I  (®>  c  ,  v )  is  a  probability  space)  . 


Consider  also  the  following  mappings: 


(32) 

S  : 

■*(■*) 

-# 

an 

where 

for 

all 

r 

€  M(A) 

t 

S( 

*)  : 

a  -*  a 

where 

for  all 

(33) 

Sir; 

)  («) 

g 

r_1[U(u 

») 

.1] 

— 

(B  |  B 

€ 

A 

and 

r(  B) 

e  [U (w ) 

,  1]) 

= 

(B  |  B 

e 

A 

and 

U(w) 

*  MB)> 

= 

(B  |  B 

c 

A 

and 

W  € 

o 

rH 

1 

D 

r  ( B )  ]  }  , 

(34) 

h  : 

*  (A) 

-♦ 

»r(C) 

i 

s  injective. 

where 

for 

all 

T 

€  A  (A) 

# 

(35)  h(r )  ^  Pro (s (t) ) 


noting  that  for  all  A  e  a  , 


(36)  (S(r)  1{CA) 


(u  |  u  €  a  and  A  €  S(r )(w)} 
(u  |  u  «  tl  and  U(w)  S  r(A)) 
U-1[0,r(A)]  €  a  , 


(a,  c,  >r«(S (r))  *)  is  the  induced  probability  space  under  S 
and  hence 


Pro (S(r ) ) -1  «  *r(c)  . 

The  above  result  directly  Implies 

(37)  h(r)  (CA)  =  ?rou-1[0,r (A) ] 

=  r (A)  ;  all  A  e  A  . 


is 


(r)  , 
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Also  define 

(38)  k  :  »r(C)  -»  a(a)  . 

surjective,  where  for  all  v  e  9t(c)  ,  k(w)  S  v  (c  ) 

Note  the  surjectivity  of  k  holds  since:  For  any 

t  «  A(A)  ,  h(  r )  e  9r  (£)  , 

and 

(40)  k(h(r))  =  h( r ) (C  _  )  *  r(.  ) 

Note  also  that  for  any  element  nee,  writing  it  symbolically 
as 

(41)  n  *  comb(C.n,U)  (••<=£-•)  » 

and  any  r  e  41(A)  ,  extending  eq.  (36)  , 

(42)  (S(r))-1  (»)  =  comb(C.n,U)(--(S(r),)1(CA)..  ) 

=  comb(C ,n,U) (• .U_1[0,r (A) ]•  •  ) 

=  0_1(comb(C ,n,U) (• • [0,r(A)]-- ))  , 

because  inverse  functions  preserve  all  ordinary  set  operations,  and 

which  now  can  be  conveniently  computed.  In  tuin  Pr®(S(r))  1(n) 

can  be  obtained  as  Pr<>U-1  ( comb  (C  ,U  ,0  )  ( •  •  [  0 ,  r  ( A)  ]  •  *  )  )  from  eq.  (42). 
Thus  h(r)  can  be  obtained  essentially  in  a  natural  constructive 
sense  over  c  . 

M 

(i)  In  summary t no ting  the  mapping 

(41)  C,  :  A  -*  (cA  |  A  <  a  )  s  c  ,  is  bijective  and  hence  injective 

in  £  ;  h  in  eq.  (34)  is  injective;  eqs .  (37)  and  (42)  all  imply 

that  in  a  natural  way  •*(■*)  is  imbedded  within  Pr®(c)  .  (Indeed, 

this  development  is  essentially  equivalent  to  the  representation  of 
dispersions  through  one  point  coverage  functions  of  nested  random 
sets  as  developed  in  Chapters  4-6.) 

(ii)  Also,  referring  to  the  notation  in  part  (I),  replacing  a 
there  by  £  ,  etc.,  then  for  all  f  such  as  in  eq.  (23),  so  that 

eq.  (22)  holds,  it  follows  from  Theorem  1  that  all  probability 
measures  v  «  er(£)  are  D-admissible^  (making  the  obvious  iden¬ 
tifications  as  need,  c.g.  in  eq.  (21)). 

Hence,  results  (i)  and  (ii)  imply  in  the  above  sense  that  all 

r  e  A(a)  are  admissible ^  ! 
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B.  Research  issues 


The  development  of  generalized  set  theory  -  as  opposed  to 
Zadeh's  fuzzy  set  theory  and  classical  probability  -  is  truly  a 
brand  new  area,  The  applications  to  natural  language  modeling  may 
enable  feasible  use  in  a  systematic  manner  for  information  that  was 
previously  ignored  to  be  fully  integrated  with  statistical  descrip¬ 
tions  of  unknown  parameters,  including  data  association,  search 
parameters,  and  fault  diagnosis  levels.  The  basic  combination  of 
evidence  algorithms  presented  in  Chapters  8  and  9  in  their  present 
form  have  the  capability  of  treating  both  literal  and  numerical 
information.  This  should  be  tested  against  simulated  real-world 
situations.  All  of  these  developments  rest  on  the  theoretical  bases 
established  previously.  The  research  issues  to  be  presented  conse¬ 
quently,  must  have  an  impact  on  the  design  of  these  algorithms. 
Choice  of  the  most  appropriate  general  logical  system  is  another 
very  important  factor,  as  is  resolution  of  random  set  representation 
ambiguity  for  modeling  input  information.  Modeling  of  natural 
language  forms  is  of  course  also  critical.  Thus,  all  of  the 
research  issues  really  drive  the  design  of  the  algorithms  in  a  true 
symbiotic  relationship. 

In  this  section,  we  pose  a  number  of  basic  questions  (with  not 
enough  answers  at  present!)  for  issues  that  arise  concerning  the 
role  of  natural  language  information,  semantic  evaluation  of  formal 
language,  general  .logical  systems,  and  other  concepts  in  the  model¬ 
ing  of  uncertainty.  Unfortunately,  a  number  of  basic  problems 
remain  concerning  the  foundation  and  implementation  of  the  combi¬ 
nation  of  evidence  problem,  discussed  in  Chapters  8  and  9.  Even 
more  basic  than  this  are  open  problems  involving  the  representation 
of  natural  language  Information,  semantic  evaluations,  and  a  number 
of  other  important  issues. 


Issue  #1  Combination  of  evidence  (Chapters  8  and  9) . 

Primary  on  the  list  Is  that  of  the  problem  of  combining  evi¬ 
dence.  Although  some  results  have  been  obtained  In  the  direction  of 
generalizing  the  classical  probability  form  of  the  Weak  Law  of  Large 
Numbers  to  generalized  set  theory  (section  8.4),  more  satisfactory 
upper  bounds  on  convergence  or  firming-up  rates  must  be  obtained. 
The  new  idea  of  extending  a  t-conorm  to  an  uncountable  number  of 
arguments,  analagous  to  the  role  integration  plays  with  averaged 
summations,  should  prove  useful-as  developed  in  section  8.4.  In  a 
related  manner,  suppose  there  are  n  probability  or  fuzzy  set 
descriptions  -  noting  that  in  the  context  of  generalized  set  theory, 
unlike  Zadeh's  original  development,  all  probability  functions  and 
their  operators  can  be  shown  to  be  special  cases  of  the  general 
system: 


♦A . <K  :  *  -  [0,1] 

1  n 

for  common  unknown  parameters  9  €  X  ,  which  also  may  contain  some 
nuisance  subparameters.  In  the  basic  combination  of  evidence  algo¬ 
rithm,  (see  Chapters  8  and  9)  9  may  be  identified  with  the  pair 

( Z , Q )  ,  where  Z  ,  the  dummy  data  parameter  may  be  later  projected 

out  and  where  Q  represents  the  prior  data  association  level 
between  two  given  track  histories.  Clearly,  for  any  choice  of 

confidence  levels  a,,..., a  ,  the  region 

in 
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Thus  in  an  inclusion  set-sense  ,  Rn(Ai ,A  ;<x <an)  is  decreasin9 

in  n  ,  for  n  =  1,  2,  .  However,  this  decrease  is  not  necessarily 

to  a  single  point  {8}  as  would  be  desirable  and  occur  under  rea¬ 
sonable  asymptotic  conditions  within  a  pure  probabilistic  setting  - 
i.e.  classical  consistency  would  hold.  Indeed,  the  above  sets  are, 
in  general,  not  compact,  so  the  convergence  is  not  necessarily  to  a 
simple  set.  What  is  sought  is  a  single  generalized  set  A(n)  °f  x 

and  some  combining  confidence  level  function  gn  , 

9n  :  [0,l]n  -  [0,1]  for  each  n  ,  such  that 


Rn(Al - An ' a 1 ' 


([V°V 


-  an } , 1 ] ) 


and  the  expression  on  the  right-hand  side  of  the  above  equation 
decreases  in  a  set' inclusion-sense  for  all  n  sufficiently  large, 
although  not  necessarily  to  only  e  .  Also,  numerical  upper  bounds 
on  these  expressions  and  rates  of  decrease  are  desirable.  Previous 
results  were  obtained  only  when  gn  was  given  in  nondecreasing 

form CBO]. Here  we  wish  to  derive  g^  under  no  restrictons,  if  pos¬ 
sible.  (See  also  section  8.4  and  Chapter  9  ,F)  for  related  results 
concerning  the  asymptotic  behavior  of  combination  of  evidence  esti¬ 
mation  as  information  granularity  decreases.) 


That  the  combination  of  evidence  problem-even  for  classical 
statistical  modeling  -  in  expert  rule-based  systems  is  still  an  open 
issue,  can  be  affirmed  by  the  many  discussions  and  ad  hoc  results. 
Finally,  it  should  be  remarked  that  the  random  set  one  point  cov¬ 
erage  representations  mentioned  earlier  could  play  a  useful  role  In 
approaching  this  problem  by  transferring  known  properties  of  com¬ 
bining  random  sets  to  the  fuzzy  set  images. 
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Issue  *2 .  One  point  coverage  functions  of  randoa  sets 

and  equivalent  dispersions  (Chapters  5  and  6). 

Another  issue  of  importance  is  the  resolution  of  the  ambigu.'ty 
of  the  one-point  coverage  representation  of  dispersions,  or  equi¬ 
valently,  generalized  sets  by  random  sets  -  and  analogously,  of  the 
compound  set  evaluation  representation  of  generalized  sets  by  random 
variables.  Some  progress  has  been  made  on  this  problem  in  the  case 
of  finitely  discrete  spaces,  by  making  use  of  entropy  as  a  measure 
of  ordering  the  representative  random  sets.  (See  section  5.4.)  It 
is  known  that  the  minimal  entropy  set  must  lie  in  the  vertex  set  of 
a  space  which  is  bijective  to  the  class  of  all, representing  random 
subsets  to  the  given  fuzzy  set.  However,  further  characterizations 
are  not  known  at  present.  On  the  other  hand,  it  is  easily  shown 
that  the  maximal  entropy  solution  is  given  by  the  choice  of  the  ran¬ 
dom  set  T ( A)  ,  for  given  fuzzy  set  A  ,  where  the  ordinary  random 
membership  function  for  T(A)  corresponds  to  a  mutually  statis¬ 
tically  independent  zero-one  process,  with  one-values  corresponding 
to  the  values  of  the  dispersion  $A  .  (See  section  5.4.) 

The  extension  of  the  probl  m  to  a  more  general  setting  for  t’  a 
base  space  is  of  course  desirable.  This  also  must  entail  -  as  has 
been  carried  out  for  the  previous  special  case  -  characterizations 
of  the  entire  solution  class  of  random  set  representations  for  each 
fuzzy  set.  Other  measures  of  the  ordering  of  uncertainty  should 
also  be  considered,  such  as  generalized  variances,  volume  coverages, 
and  variations  of  classical  information  measures. 


Issue  #3 .  Randoa  variable  representations  of  dispersions 
(Chapter  6) . 

It  has  been  shown  (as  briefly  mentioned  above  in  #2)  that  not 
only  do  all  generalized  sets  have  random  set  representations,  but 
also  possess,  in  a  canonically  corresponding  way  to  each  such  random 
set  representation,  random  variable  representations,  where:  given  e 
base  space  X  and  fuzzy  subset  A  with  membership  function 
<f>A  :  X  -»  [0,1]  ,  a  random  set  representation  S  (in  general;  S  is 

not  unique,  since  for  example  Sy(A)  3  1  (  [  U ,  1  ]  )  or  T(A)  as 

described  briefly  in  item  #2,  may  be  chosen,  among  many  others 

[77]  )  ,  S  S  X  ,  holds  with 

<$>A(x)  =  Pr(x  t  X). 

In  addition,  a  space  of  elementary  events,  say,  Y  ,  exists  and  a 
random  variable  V  (in  general  not  unique,  but  determined  through 
S)  exists  over  Y  such  that  all  points  x  in  X  may  be  considered 

also  as  compound  events  from  Y  ,  i.e.,  subsets  of  Y  ,  such  that 

Pr(V  ex)  =  4>a(x)  ;  all  x  e  X  . 

Analogous  to  the  isomorphic-like  or  weak-homomorphis  relation¬ 
ships  developed  in  this  book  between  fuzzy  set  ODorations  and  random 
set  operations,  similar  relationships  are  sought  for  random  variable 
representations . 
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Issue  #4.  Multiple  point  coverage  representations 

The  problem  of  multiple  point  coverage  function  representation 
by  random  sets  also  remains  an  open  problem.  In  the  extreme  cases 
where  all  possible  subsets  and  incidences  foi  random  sets  are 
specified,  a  number  of  results  have  been  obtained  by  exploiting  the 
Choquet  capacity  tneorem  and  its  variations.  Also  tie-ins  with  the 
Dempster-Shaf er  theory  of  uncertainty  have  been  established  as  part 
of  this  work.  (See  e  pecially,  Chapters  4  and  5.)  In  addition,  the 
other  extreme  case  .rn  also  been  investigated  with  many  fruitful 
results:  namely,  the  one  point  coverage  case  described  in  the  above 

items.  However,  little  progress,  except  for  tne  work  ir.  discrete 
spaces  characterizing  all  random  set  representations,  has  been  car¬ 
ried  out  for  the  following  n-point  coverage  problem: 

Let  9  (X)  denote  the  class  of  all  ordinary  subsets  of  space 
n 

X  which  have  at  ■■-•t  n  elements  (distinct).  Let 
f  :  *n(X)  -  [0,1] 

be  a  given  function.  (When  n  =  1,  this  is  the  same  as  a  fuzzy  set 
membership  function.)  Then  conditions  axe  sought  for  f  and  a 
random  subset  S  of  X  is  sought^such  that 

f(C)  =  Pr(C  S  S)  ;  all  C  €  *n(X)  . 

This  problem  entails  many  more  difficulties  than  the  one  point 
coverage  problem,  since  in  general  it  can  be  shown  that  unless  f 
is  suitably  restricted,  no  solution  exists  for  the  above  equation 
for  S  . 


Issue  #5.  Extension  of  category  and  topos  theory  f onmlations  of 

generalized  set  theory  and  relations  with  general  logical 
systems  (section  2.4). 

Recall  the  extensive  discussion  in  section  2.4.2  (F)  con¬ 
cerning  the  relations  between  Goguen's  category,  Gog  ,  Eytan's 
original  category  Fuz  and  modified  category  (and  indeed  in  some 
cases,  topos)  Fu2„  -  and  Higg's  topos  Higg  . 

(1)  Is  Higg  the  smallest  topos  containing  Fuz  ,  whose  subobject 
classifier  is  H  ,  the  natural  complete  Heyting  algebra  general¬ 
ization  of  the  unit  interval  [0,1]? 

(2)  What  functors  can  be  established  imbedding  the  extensions  or 
"arictions  of  -hem  into  Higg?  It  should  be  mentioned  that  Eytan 
attempted  a  second  time  (unpublished  personal  communications  [57]) 
to  show  that  Fuz  suitably  enlargened  was  a  topos.  (However,  see 
the  construction  of  Fuz  in  section  2.4.2.) 

(3)  Can  we  extend  Fuz  and  Higg  to  reflect  the  generalized  (fuzzy) 
set  theory  established  here?  (This  Is  In  place  of  Zadeh's  system.) 
Of  course,  since  the  mutual  distr ibutivity  property  will  fail  in 
general  for  the  "and"  and  "or"  operations,  we  may  no  longer  have 
classical  categories,  but  a  good  deal  of  structure  remains.  For 
example,  Manes  proposed  a  generalization  of  Fuz  which  is  not 
compatible  with  generalized  set  systems.  Vet,  we  can  show  the 
following  extension  of  Manes'  ideas'.  (See  also  section  7.2.) 
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Define  function  T  :  SET  -»  SET  ,  where  SET  is  the  topos  of 
all  well-formed  sets.  The  object  class  is  indeed  all  sets,  while 
the  arrow  class  is  all  ordinary  functions  between  all  (ordinary) 
sets.  Let  {$  *,  ,  $  )  be  a  fixed  general  logical  system, 

where  the  first  operator  is  realized  as  a  negation,  the  second  as  a 
t-norm,  and  the  third  as  a  t-conora.  Then  for  any  set  X  ,  define 


9"  ( X )  -  the  class  of  all  generalized  subsets  of  X  . 
For  any  fucntion  f  :  X  -»  Y  (thus  in  Ar(X,Y))  ,  define 


s'  (  f )  :  s'  ( X)  -  S'  ( Y)  ,  where  for  any  A  «  s-(X)  ,  ( sr  ( f )  )  (A)  §  f(A)  , 
where  for  any  y  «  Y  ,  the  evaluation  of  f(A)  e  S'(Y)  is  given  by 
^f (A) (Y)  =  "Y  «  f (A) )H 

=  n  (  (3x)  (  (x  e  A)  a  (f  (x)  =  y  )  ) ) ii 


<P 


or 


(<PA(x))  • 


lx€  f-1 (y) ) . 


V 

It  then  follows  immediately  from  the  above  form  that  for  any  f  e  Y 
y 

and  any  g  e  Z  ,  that 

s’  ( g°  f )  *  s  ( g)»T  ( f  )  and  S(identx>  =  ident^^j  , 

which  in  turn  implies  that  s  is  a  functor  from  SET  back  into 
itself  (properly) ! 

Thus,  we  pose  the  following  basic  problem: 

Extend  the  definition  of  s  and  investigate  all  possible 
relevant  properties  of  s  and  relate  this  to  Manes’  "fuzzy 
theories" . 


Issue  #6.  Extensions  of  classical  estimation,  hypotheses  testing 
and  other  decision  porcedures  (Chapter  8). 

Dubois  and  Prade  ([51],  pp.  255-264)  have  devoted  a  section  of 
their  compendium  on  fuzzy  set  theory  to  the  modeling  of  fuzzy  set 
membership  functions.  A  number  of  basic  approaches  are  considered, 
with  some  emphasis  on  the  empirical  aspects  of  the  modeling.  In 
this  vein,  it  should  be  added  that  the  weak  homomorphic  theory 
developed  for  example  in  Chanters  4-6  shows  that  the  idea  of 
ccnputing  the  degree  to  which  a  fixed  value  possesses  a  given  attribute 
may  be  interpreted  as  the  evaluation  of  the  numoer  of  times  (a  fixed 
value  possesses  a  given  attribute ), which  in  turn  is  interpreted  as 
the  evaluation  of  the  corresponding  fuzzy  set  membership  function 
(see  also  section  8.5),  as  is  typically  done  by  survey  sampling  of 
individuals.  This  may  be  Identified  with  the  empirical  one  point 
coverage  probability  function  generated  by  a  random  sample  of  random 
sets  that  are  identically  distributed  and  are  one  point  coverage 
equivalent  to  the  attribute  or  generalized  set  in  question.  How¬ 
ever,  the  following  Issue  has  not  been  sufficiently  emphasized:  In 

classical  statistical  techniques,  modeling  of  distribution  functions 
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is  often  carried  out  in  two  basic  steps.  First,  a  parametrized  fam¬ 
ily  of  distributions  is  chosen.  This  may  be  done  using  invariance, 
shape,  use  of  Central  Limit  Theory,  or  via  trends  of  earlier  empir¬ 
ical  evidence.  The  family  is  chosen  so  that  it  reasonably  contains 
the  viable  alternatives  for  the  true  distribution  and  its  size  is 
adjusted  accordingly.  Then  empirical  data  or  restrictions  are 
imposed  -  such  as  unbiasedness ,  sufficiency,  minimal  risk  with 
respect  to  some  choice  of  loss  function  on  errors  and  estimates  of 
the  unknown  parameter  value  -  yielding  either  a  unique  value  or  a 
reduced  set  of  values  where  the  unknown  parameter  lies.  This  argu¬ 
ment  leads  to  the  conclusion  that  the  same  procedure  should  be 
applied  to  more  general  approaches  to  uncertainty  modeling,  where 
dispersions,  replacing  the  classical  probabilistic  ones,  play  a  key 
role  in  the  theory.  Examples  of  this  are  fuzzy  set  theory,  with  its 
fuzzy  set  membership  functions,  flou  set  theory  with  its  index 
functions  relative  to  the  individual  sets  forming  a  given  flou  set, 
and  topological  neighborhood  theory  with  its  neighborhood  filters. 
(See  Manes'  unified  treatment  of  these  theories  as  described  in 
section  7.2.) 

Other  analogues  can  be  established  between  generalized  set 
theory  as  applied  to  parameter  estimation  and  classical  statistical 
estimation  theory  (see  Chapter  8).  For  example,  one  can  assume  a 
linear  regression  model  is  valid  connecting  observations  with  an 
unknown  parameter  vector,  with  no  specification  of  the  relevant 
distributions  involved  -  at  least,  at  first.  Then  least  squares,  or 
more  generally,  a  least  weighted  functional  defined  on  the  potential 
errors  between  observations  and  possible  values  of  the  parameter  in 
question,  is  derived,  yielding  a  reasonable  value  of  the  unknown 
parameter  as  a  function  of  the  observations,  i.e.,  a  statistic,  if 
distributional  assumptions  were  to  be  made.  Then  if  a  fuzzy  set 
modeling  approach  is  taken,  the  observations  could  be  assumed  to  be 
generated  from  corresponding  generalized  set  membership  functions, 
yielding  in  turn  through  the  standard  extension  of  an  ordinary  func¬ 
tion  (see  section  2.3.7)  the  generalized  set  membership  function  of 
the  "statistic".  In  turn,  this  leads  in  a  natural  way  to  confidence 
sets  for  the  unknown  parameter  vector,  by,  for  example,  considering 
the  level  sets  associated  with  the  aforementioned  membership 
function.  (See  sections  8.2  and  8.3  for  a  related  technique.) 
Asymptotic  properties  of  these  (general  set)  estimators  may  also  be 
obtained  as  the  sample  sizes  are  increased.  (See  section  8.4.) 
Bayesian  techniques  may  also  be  developed  involving  conditional 
distributions  in  the  generalized  set  sense.  (See  sections  2.3.8  and 
8.2  for  development  of  these  concepts  for  generalized  sets.) 

Can  we  develop  a  comprehensive  decision  theory  directly  extend¬ 
ing  classical  probability  and  statistical  theory,  by  utilizing  both 
numerical  and  linguistic  data  sources  through  generalized  set 
theory?  The  beginnings  of  such  a  theory,  where  independence,  con¬ 
ditioning,  Bayesian  forms,  loss  functions,  invariance  techniques, 
etc.,  are  given  in  Chapter  8.  In  addition,  as  mentioned  previously, 
an  extension  if  the  Weak  Law  of  Large  Numbers,  involving  both  dis¬ 
crete  and  a  continuous  infinity  of  operator  arguments,  has  been 
carried  out.  This  impinges  directly  upon  the  combination  of  evi¬ 
dence  problem  (item  #1)  and  is  illustrated  by  the  algorithm 
development  given  in  Chapter  9.  As  mentioned  before,  the  entire 
approach  to  generalizing  classical  set  theory  and  probability  theory 
uses  the  Principle  of  Abstraction:  evaluation  of  the  concept 

through  formal  expression  in  classical  logic  relations,  followed  by 
choice  of  a  particular  logical  cystem.  The  interpretation  is  often 
carried  out  through  an  isomorphism  between  truth  evaluation  and 
logical  operators  and  the  membership  or  dispersion  interpretation  of 
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atomic  predicates  (x  e  A)  .  (A  simple  example  of  this  was  given  in 
issue  #5  for  the  truth  evaluation  of  f(A)  .)  This  approach  stands 
in  contradistinction  to  the  many  brute  force  " fuzzifying"  procedures 
available . 

Finally,  can  f igures-of-merit  representing  the  central  ten- 
dancy,  variances,  etc.,  be  developed  for  generalized  sets? 

In  conjunction  with  the  issues  raised  above,  we  pose  two  re¬ 
lated  questions: 

(a)  What  specific  logical  system  should  be  chosen  for  a  particular 
problem?  As  mentioned,  a  combination  of  theoretical  and  empirical 
guidelines  seems  the  most  appropriate  approach.  The  theoretical 
part  stems  from  the  weak  homomorphic  relations  between  classes  of 
fuzzy  set  systems  -  such  as  the  semi-distributive  or  copula  systems 
-  and  random  set  systems  and  their  operators.  The  empirical  may  be 
based  upon  the  many  studies  of  Zimmermann  [291],  e.g.  ,  but  then 
suitably  combined  with  necessary  parameterizations . 

(b)  Similar  remarks  hold  for  the  modeling  of  generalized  set  mem¬ 
bership  functions. 

Both  problems,  to  a  certain  extent,  imitate  the  situation  in 
classical  statistical  modeling:  How  do  we  choose  the  appropriate 
dispersions  and  what  independence/dependence  assumptions  should  we 
make?  Do  we  use  invariance  or  loss  function  theory  as  a  guideline? 
In  the  case  of  natural  language  modeling.  In  general,  more  emphasis 
must  be  placed  upon  subjectively  obtained  data  and  cognitive  pro¬ 
cesses  . 


Issue  #7 .  Formal  language  representation  of  natural  language 
Information  and  related  problems  (section  2.2). 

Inference  rules  constitute  the  heart  of  any  combination  of 
evidence  algorithm.  Thus  the  faithful  modeling  of  natural  language 
descriptions  is  a  key  concern.  This  entails  the  ability  to  handle 
not  only  simple  declarative  sentences  but  other  modal  forms,  com¬ 
plicated  predicates,  tenses,  clausal  forms,  multiple  argument 
predicate-relations,  and  quantifications  which  may  be  crisp  or 
fuzzy. 

Systematic  symbolization  of  natural  language  is  a  much  sought 
after  goal.  Some  progress  In  this  is  shown  in  the  appendix  at  the 
end  of  this  section  and  the  examples  of  section  2.6.  (See  also  the 
numerous  references  mentioned  throughout  section  2.) 

Consider  first  the  following  questions: 

Can  we  express  all  human  Ideas  or  concepts  in  terms  of  natural 
language?  Can  these  ideas  be  reduced  to  primitives  and  operators 
Involving  them?  Can  a  mathematical/logical  procedure  be  developed 
for  describing  and  analyzing  natural  language  in  a  unified  way,  in¬ 
cluding  attributes  and  operators?  What  quantitative  relationships 
can  be  established  between  prelinguistic  ideas  and  natural  language? 
How  sensitive  or  robust  -  and  how  subjective  -  are  concepts  trans¬ 
lated  into  natural  language  with  respect  to  the  particular  language 
chosen  (the  Whorf-Sapir  hypothesis  is  involved),  the  individual,  and 
the  medium  used?  Is  there  always  inherent  ambiguity  in  modeling  a 
given  concept  in  natural  language?  Can  we  make  use  of  the  enormous 
body  of  literature  available  which  treats  formal  linguistics  and 
semantics  to  develop  a  systematic  unified  framework  directly  rela- 
table  to  multivalued  logic  theory?  In  a  related  manner,  we  may  ask 
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if  a  unified  approach  to  uncertainty  models  (&  la  Manes  -  see 
Chapter  7  for  further  details  on  Manes'  proposed  answer  to  Zadeh's 
fuzzy  set  theory  as  well  as  theories  (and  criticisms)  of  Gaines, 
Schefe,  Watanabe  and  others),  and  to  natural  language  would  be 
possible?  Can  the  efficiency  of  the  various  approaches  to  modeling 
uncertainty  be  meaningfully  compared? 

In  a  more  specific  direction,  it  is  clear  that  we  must  continue 
the  development  of  a  taxonomy  of  language  forms  relative  to  symbol¬ 
izations  as  illustrated  in  the  text.  Thus,  modal  and  temporal  forms 
should  be  treated  by  a  combination  of  •  $5  •  <?>or  >  generalized 

set  system  operators.  These  act  upon  time  and  other  variables  in 
the  form  of  both  indices  and  elements  of  generalized  sets.  Compound 
relations  may  be  best  treated  as  formally  n-ary  generalized  set 
relations.  These  new  results  replace  Zadeh's  well  known  fuzzy 
cardinality  approach.  They  are  based  upon  first  principles  -  only, 
the  generalized  system  chosen }  and  the  resulting  conditioning  of 
predicates  -  rather  than  upon  summation  of  membership  functions  with 
respect  to  their  arguments. 

Much  interest  has  appeared  (personal  communications)  for  this 
new  approach.  In  any  case,  a  comprehensive  procedure  must  be  deve¬ 
loped  for  systematic  symbolization,  extending  Zadeh's  previous  work. 

Finally,  we  ask: 

Can  we  use  formal  linguistic/semantic  metrics  for  determining 
accuracy  of  symbolizations?  For  example,  see  the  well-established 
techniques  of  Osgood,  et  al . ,  [198],  Standard  analysis  of  variance 
techniques  properly  applied  or  factor  analysis  as  used  in  psycho¬ 
metrics,  and  specialized  isolation  of  concept  techniques  such  as 
Nowakowska's  [195']  can  be  pooled.  Or,  tree-diagramming  and  parsing 
techniques  and  checks  could  be  useful  in  comparing  variations  in 
symbolizations  due  to  inherent  ambiguity  of  forms  in  language  or  the 
necessary  nonuniqueness  in  extending  classical  two-valued  logic  to 
infinitely-based  logic  via  the  Principle  of  Abstraction.  (See 
[172].)  As  Chomsky  has  been  often  paraphrased:  the  efficacy  of  a 

linguistic  theory  is  its  test  against  actual  ('Randomly"  chosen) 
linguistic  situations. 


Issue  #8 .  Semantic  evaluations,  logics,  soundness,  completeness, 
consistency  (sections  2.3,  2,4). 

The  problem  of  choosing  the  most  appropriate  logic  as  the 
vehicle  for  modeling  natural  language  information  has  been  mentioned 
already  in  #6.  Do  we  choose  a  truth  functional  system  or  a  proba¬ 
bilistic-like  one,  such  as  Lebesgue-Stielt jes  logic  (2.3.9)?  Truth 
functional  systems  are  easier  to  work  with  but  may  net  reflect 
empirical  conditions.  In  conjunction  with  this,  we  may  pose  the 
following:  can  any  non-truth  functional  system  be  in  some  way 
approximated  by  a  truth  functional  one?  Can  a  logic  be  chosen 
together  with  a  corresponding  general  set  theory  which  is  sound  and 
complete,  or  at  least  consistent,  in  possibly  some  "local"  sense? 
(See  the  various  discussions  throughout  Chapter  2.) 


Issue  #9 .  Knowledge-based  systems 

There  is  a  clear-cut  need  for  a  more  comprehensive  design  of 
knowledge-based  systems  as  developed  here.  As  presented  in  this 
text,  such  systems  are  only  Informally  organized  as  a  feedback-loop 
between  a  panel  of  experts  and  the  chosen  general  logical  system(s). 
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In  addition,  there  is  always  room  for  improvement  in  how  information 
is  extracted  from  experts.  Improved  questionnaire  techniques  and 
psychometric  procedures  (some  of  which  were  mentioned  in  #7)  can  be 
helpful.  In  short,  what  is  needed  is  a  specialization  of  the  gen¬ 
eral  theory  of  systems  -  many  lucid  articles  of  which  can  be  found 
in  the  International  Journal  of  General  Systems  (Great  Britian)  -  to 
knowledge-based  systems  as  interpreted  here.  The  recent  text  of 
Hayes-Roth,  et  al .  [107']  provides  an  excellent  overview  of  know¬ 
ledge  acquisition  and  architecture,  as  well  a  evaluations  and 
syntheses  of  knowledge-based  (and  expert)  systems. 


Issue  #10 .  Miscellaneous  conents  on  uncertainty  modeling 

In  classical  statistical  decision  theory,  for  finite  sample 
sizes,  non-zero  lower  bounds  -  of  the  Rao-Cramer-Fisher  type  [212]  - 
exist  for  the  second  moment  matrix  of  parameter  estimator  errors 
under  sufficient  regularity  conditions.  In  conjunction  with  this, 
at  the  quantum  level,  the  well-known  Heisenberg  uncertainty  prin¬ 
ciple  furnishes  lower  bounds  on  simultaneous  variances  of  position 
and  momentum  measurements.  General  dispersion  analogues  to  the 
above  precision  (or  equivalently,  entropy)  bounds  are  sought. 
However,  as  pointed  out  in  issue  #6,  development  of  measures  of 
central  tendency  and  higher  order  moment  analogues  for  dispersions 
remain  to  be  fully  developed,  and  so  all  subsidiary  investigations, 
including  the  above  for  precision  bounds,  must  be  held  in  abeyance. 

In  another  direction,  the  role  of  idempotence  and  flxea  point 
theorems  in  uncertainty  modeling  should  be  pointed  out.  (We  have 
already  considered  idempotent  t-norms  and  t-conorms  in  section 
2.3.6,  )  Such  mathematical  tools  have  been  well-developed  in  both 
existence  forms  (such  as  generalizations  of  Schauder's  fixed  point 
theorem  for  continuous  functions  over  convex  subsets  of  Banach 
spaces)  and  construction  forms  (being  essentially  an  eigenvector- 
eigenvalue  problem  for  unit  eigenvalues) . 

In  classical  estimation,  under  reasonable  conditions,  it  can  be 
shown  directly  that  if  X  and  Y  are  random  vectors  representing 
an  unknown  (state)  parameter  vector  and  observed  data,  respectively, 

L  is  an  estimation  loss  function  and  the  Bayes  estimators  £(Y)  is 
unique,  etc.,  then  the  idemootence  relation 

S(Y)  *  £(S( Y) ) 

holds . 

For  any  feedback-loop  system  -  such  as  a  knowledge-based  system 
amenable  to  design  changes  via  a  panel  of  experts  -  we  compare 

y  and  y  _  =  r  (x  (y  ,6  _),  $  _) 
t  t  t  t  t 

where  subscript  t  indicates  present  time,  t_  ,  indicates  predic¬ 
ted  to  t  ,  y  indicates  observed  or  potentially  observable  data, 
x  unknown  parameter  vector  and  R  and  S  design  matrices,  which 
include  error  covariance  matrices,  transition  matrices,  and  measure¬ 
ment  matrices  as  components,  in  the  case  of  Kalman  filtering. 
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If  the  system  reaches  a  steady  state  level,  i.e.,  y  and  y 

t  t~ 

approach  a  common  constant  value  with  &  _  ,  6  _  ,  §  all  ap- 

t  t  t 

proaching  constants  as  t  -»  «  ,  then  the  relation  becomes  approx¬ 

imately  the  following  fixed  point  one: 

Y.  -  g(Y«:  Q„>  • 


where 


qj  -  *  $  > 


Note  also  the  widespread  use  of  fixed  point  theorems  in  ob¬ 
taining  various  properties  about  logics  such  as  in  Rescher  [214], 
Chapter  III,  where  a  modified  form  of  the  Axiom  of  Comprehension  is 


shown  to  be  model  consistent  with  a  variation  of  logic  L 


On 


the  other  hand,  more  constructive  aspects  of  fixed  point  theory  can 
be  utilized  in  the  modeling  and  semantic  evaluation  of  reflexive  or 
self-referencing  sentences.  Interestingly  enough,  as  Bellman  points 
out  in  Chapter  9  of  his  book  on  AI  [13* ],  such  sentences  are  often 
the  sources  of  humor,  such  as  the  statement  telling  us  to  plan 
ahead,  which  itself  is  not  well-planned  relative  +o  page  spacing  or 
the  one  proclaiming  that  no  one  should  use  profane  language,  des¬ 
cribed  through  the  use  of  such  language. 

Consider  the  following  examples  of  reflexive  statements: 


51  =  "This  sentence  is  true." 

52  =  "This  sentence  is  false." 

S  =  "This  sentence  is  q-true." 

O 


(q-true  is  some  linguistic  quantifier  such  as  "mostly", 
"approximately  3/4",  etc.) 

-  "This  sentence  contains  essentially  gray  colored 
characters . " 

S,.  =  "This  sentence  contains  mostly  roundish  shaped  letters." 
Sg  =  "This  sentence  cannot  be  deduced  within  classical  logic 
and  the  theory  of  arithmetic." 

If  we  apply  the  general  principles  developed  within  the  text, 
it  follows  that: 


Sa  =  ClISjll  €  (i)>  , 

"Sl"  =  <t>{l)(IISl"  )  5  U> 

s2  =  <11  S 2 II  e  { 0 }  >  , 

"V  =  *<0}("S2"  }  ' 


(2) 
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and  more  generally, 


S3  =  <IIS3H  €  q>  ' 

iiS3it  =  $  (MSgli  )  .  (3) 

Clearly,  'f  {1}  represents  the  crisp  set  singleton  1,  then  a 
natural  solution  for  the  semantic  evaluation  In  eq.  (1)  is 

HSji:  =  1  . 


However,  if  {0}  represents  the  crisp  set  singleton  0,  eq.  (2) 
has  no  solution  for  IIS^H  and  leads  to  the  famous  "Liar's  paradox". 

But,  if  {0}  is  replaced  by  a  fuzzy  number,  i.e.,  ${0}  is  a 

monotone  continuous  decreasing  function  around  0  with  <}>  (0)  =  1 

and  <J>  ^  q  j  ( c )  =  0  for  some  minimally  chosen  c  ,  0  <  c  s  l  ,  then 

eq.  (1)  doos  have  (a  fixed  point)  solution  (as.  is  easily  seen 
graphically) ,  which  approaches  0  as  c  approaches  0  . 

In  general,  eq.  (3)  will  have  at  least  one  (if  is  non¬ 

trivial  symmetric  unimodal,  normalized  at  some  xQ  e  (0,1)  ,  then  in 
general,  two)  solution(s)  for  itS.ii  . 

J 

On  the  other  hand,  S4  is  more  difficult  of  evaluate,  since 


S4  =  <color(S4)  e  Gray>  , 
"V  *  <*Gray<Color(S4)  )  ' 


(4) 


with  iis4li  not  appearing  on  the  right  hand  side  of  eq.  (4)  . 

Similar  remarks  hold  for  Sg  and  the  Gbdelian-like  sentence 
S6  '  Satisfactory  analysis  of  these  forms  analogous  to  the  first 
three  examples  is  yet  to  be  worked  out. 
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APPENDIX 


Modeling  Natural  Language  and  Semantic  Evaluations 


A  good  deal  of  this  appendix  is  based  upon  the  paper  [91"]. 


INTRODUCTION 

A  procedure  for  combining  evidence  based  upon  linguistic  and 
numerical/statistical  sources  is  presented  in  Chapter  8.  A  more 
specific  application  to  tracking  and  target  data  association  pro¬ 
blems  is  given  in  Chapter  9 .  The  center  of  these  approaches 
consists  of  inference  rules  and  error  tables  (and  matching  tables 
used  in  the  inference  rules)  modeled  within  a  framework  of  multiple¬ 
valued  logic.  It  is  clear  that  the  widest  possible  scope  in 
modeling  and  evaluating  natural  language  information  must  be  sought 
in  order  to  be  able  to  carry  out  efficient  programs  of  combining 
evidence.  Natural  language  sentences  are  often  complex  in  form. 
For  example,  the  following  sentences  could  occur  within  a  military 
context : 

* 

S  =■  In  Ocean  region  V  ,  and  usually  in  Region  W  ,  if  the 
weather  is  poor  and  the  sea  state  corresponds  to  rela¬ 
tively  high  turbulance,  then  indications  by  sensor 
system  A  that  a  submarine  was  in  the  area  are  not  that 
reliable  and  probably  should  be  discounted  in  favor  of 
geolocation  matching  information  obtained  from  sensor 
system  B  ,  although  exceptions  to  this  can  occur  when 
visibility  is  up  to  about  two  miles,  in  which  case,  it 
has  been  shown  that  A-data  matching  should  be  assigned  a 
much  higher  degree  of  importance  in  Its  effect  upon 
correlation. " 

*  * 

S  =  "Contact  with  the  ship  was  held  for  about  two  hours,  but 
was  lost  just  before  the  Straights  of  Skagerrak  were 
sighted,  although  purple  side-insignias  may  have  been 
spotted  as  well  as  an  oval-shaped  dome  near  the  rear  of 
the  ship,  but  a  foggy  condition  prevaded  the  area  pre¬ 
venting  any  further  Identification." 

Such  examples  as  above  illustrate  the  typical  problems  faced  in 
modeling  and  symbolizing  natural  language.  This  includes  the  inter¬ 
pretation  of  modifiers  such  as  "usually",  "relatively  high",  "not 
that  reliable",  modal  and  temporal  operators  such  as  "probably 
should  be",  "was  held  for  about  two  hours",  and  verb/predicative 

relations  such  as  "Contact  with  the  ship",  "was  lost",  "foggy 

condition  prevaded  the  area".  (See  [46’'’],  [46iv]  for  related 

linguistic  problems  arising  in  expert  systems.) 

A  systematic  approach  to  the  full  symbolization  of  language  is 
thus  most  desirable.  In  this  text,  some  modest  efforts  in  this 

direction  are  made.  Conditional  expressions,  such  as  "most  tall 
ships  in  region  5  are  enemy  ones  of  type  F"  are  considered.  The 

approach  here  is  in  contrast  to  Zadeh's  rather  arbitrary  "fuzzy 

cardinality"  approach  which  cannot  be  directly  derived  from  multi¬ 
valued  truth  considerations  [286],  [289].  A  comprehensive  approach 
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is  taken  to  the  modeling  of  temporal  and  modal  relations,  extending 
earlier  ideas  of  Zadeh's  PRUF  technique  [286].  Some  examples  are 
presented  illustrating  these  ideas  with  an  important  application  to 
the  combination  of  evidence  procedure.  A  number  of  other  modeling 
procedures,  in  addition  to  those  presented  here,  is  given  in  section 
2.6. 


NATURAL  LANGUAGE,  FORMAL  LANGUAGE,  AND  SEMANTIC  EVALUATIONS 

Too  often  in  the  past,  natural  language  information  was  ne¬ 
glected  in  favor  of  "more  precise"  numerical  data.  Or,  such 
information  at  times  was  arbitrarily  made  more  precise  to  be  in 
numerical  form.  Since  the  onset  of  Chomsky  and  others,  more  rigo¬ 
rous  outlooks  have  been  taken  toward  the  understanding  and  modeling 
of  information  content  in  language  [163],  [47].  With  the  work  of 
Zadeh  on  PRUF  [286]  began  a  new  era  in  the  development  of  a  calculus 
for  semantic  evaluation  of  natural  language.  The  following  basic 
premises  are  assumed: 

(a)  All  natural  language  information  is  translatable  into 
sequences  of  English  sentences.  The  problem  of  whether  a  given 
natural  language  molds  the  speaker's  thoughts  due  to  its  structure 
and  limitations  -  the  Whorf-Sapir  hypothesis,  or  whether  this  is  not 
valid  as  Berlin  and  Kay  claim  (see  [153]  for  comments)  -  will  not  be 
dealt  with  here. 

(b)  Ambiguity  or  meaning  is  expressed  by  (subjectively) 
weighting  the  possibility  of  interpretations.  Thus,  e.g.,  the 
expression  "I  like  her  well."  could  be 

Sj  »  "I  really  like  her." 

52  *  "I  wish  she  remains  well  (in  good  health)." 

53  *  "I  want  her  to  become  well." 

54  =  "I  like  the  well  that  she  owns." 

Weight  w^  ,  could  be  assigned  to  ,  i  *  1 ,  .  .  .  , 4  .  Usually 

context  allows  for  resolution  of  these  possible  branches  of  meaning. 
For  simplicity.  It  will  be  assumed  here  no  ambiguity  is  present. 
(See  [153]  for  further  discussion.  See  also  the  related  idea  of 
probabilistic  grammars  [249"]  and  Oden's  fuzzy  set  approach  [196], 
[196-  ] .  ) 

(c)  Any  given  sentence  in  actuality  represents  an  equivalence 
class  of  possibly  differently  appearing  -  i.e.,  syntactically 
different  -  sentences,  all  having  the  same  semantic  evaluation,  a 
number  lying  in  the  unit  interval  [0,1]  representing  its  truth 
value.  This  is  related  to  Chomsky's  concepts  of  transformational 
generative  grammar,  where  changes  in  forms  of  sentences  are  due  to 
word  order  rearranged,  use  of  synonyms,  change  of  voice  from  active 
to  passive,  or  other  superficialities  [163],  [47]  . 

(d)  Parsing  Principle:  Given  any  sentence  (or  any  equivalence 
class  of  sentences)  there  exists  an  analytic  form  or  parsing  which 
is  semantically  the  same  but  is  structured  within  a  formal  language. 
This  is  related  to  Chomsky's  deep  structure  analysis  [163],  [47). 
(For  further  details  on  formal  language  and  multivalued  logic,  see 
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[214],  [172].)  Attempts  at  establishing  automatic  procedures  for 
parsing  natural  language  into  a  corresponding  formal  language  form, 
such  as  Schank's  approach  [223]  are  many  and  the  area  remains  a 
lively  one  for  research.  (See  the  large  compendium  of  approaches  in 
[274-]  • ) 

A  typical  parsing  analysis  yields,  for  any  given  compound 
sentence  S 

S  =  comb( . . . ,not,&,or,  if{  )  ,  then, . . . ) (Sj.Sg, . . . ,Sr)  (1) 


where  the  operators  "not",  "or",  etc.,  all  indicate  the  usual 

unary  or  binary  linguistic  connectors  and  comb  indicates  some 
sequential  combination  of  these  connectors  with  sentences 
S1,S2,...,Sr  ,  the  latter  all  having  simpler  forms  than  does. 

In  turn,  each  also  has  a  parsed  form  in  terms  of  relatively 

simpler  sentences,  etc. 

(e)  Modified  Principle  of  Abstraction:  Any  sufficiently  simple 
sentence,  such  as  the  components  in  (1),  has  a  unique 

corresponding  semantically  equivalent  form 


(x  «  A) 


(2) 


where  A  represents  a  generalized  set,  property,  or  attribute,  x 
is  a  possible  vector  of  elements  in  the  ordinary  sense,  and  e  is 
the  extended  set  membership  relation  for  generalized  sets.  (See 
2.2.)  As  in  ordinary  set  relations,  A  is  considered  a  subset  of 
an  ordinary  set  X  called  the  universe  of  discourse  or  base  space 
and  in  the  ordinary  sense,  x  is  in  X  ,  i.e.,  x  «=  X  .  It  should 
be  noted  that  this  apparently  reasonable  principle  can  lead  to  para¬ 
doxes  in  formal  logical  systems,  such  as  in  classical  naive  set 
theory  or  even  in  set  theory  based  on  multiple-valued  logic,  for  a 
wide  variety  of  logics  (except  for  Lukas i ewi cz  -n  j  Logic  -  see 

section  2.3.3.).  In  the  work  here,  these  difficulties  will  be 
Ignored  for  the  time  being. 

Thus,  (1)  and  (2)  yield  for  sentences  S 

S  =  comb( . . . , not , & , or , . . . ) (Xj  e  AJf  x2  e  A2,...,xr  e  Ar) 


=  (x  «  A)  ,  (3) 

where 

x  *  (xx,x2 . xr)  (4) 


A 


comb •(...,  C  ,x 


,=*,...)(A^,A2,...,A^)  , 


(5) 


where  comb*  is  some  other  combination  function  and  C  is  the  com¬ 
plement  operator  on  generalized  sets,  corresponding  to  "not",  x  is 
the  cartesian  product  operator  corresponding  to  ,  r  is  the  car¬ 
tesian  sum  operator  corresponding  to  "or"  ,  etc.  (See  also  2.3.8.) 


(f)  Principle  of  Semantic  Evaluation:  Any  sentence  S  has  a 
truth  value  iisii  ,  a  number  in  [0,1]  which  can  be  evaluated 
through  the  values  of  the  semantic  function  it  >  it  over  component 
parts  of  S  ,  given  the  particular  semantic  function,  or  equiva¬ 
lently,  logic  chosen  (section  2.3). 
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(i)  If  the  semantic  function  is  truth  functional,  then  eq.  (3) 
is  evaluated  as 


HSH  =  comb  (...,  <t>nt ,  ,  <t>or ,  ^  <I>A  (x1)  ,  .  .  .  ,$A  (xr)) 


(6) 


where 


4>a  (x1)  =  nx1  «  Ajii  ,  i  =  1 ,  .  .  .  , r  (7) 

yielding  in  general  the  membership  or  possibility  function 
(j>  :  X.  -*  [0,1]  ,  and  where 

Ai  1 

<|>nt  *  "not II  :  [0,1]  -  [0,1]  (8) 

is  a  nonincreasing  function  with  c}>nt(0)  =  1  and  <f>nt(l)  =  0  '  the 
classical  truth  table  relations,  and  similarly, 

=  ii&ii  :  [0.1]  x  [0,1]  -  [0,1]  (9) 

is  a  nondecreasing  function  usually  assumed  to  be  bounded  above 
pointwise  by  the  function  min,  continuous,  symm. ,  associative  -  so 
that  it  is  unambiguously  extendable  recursively  to  any  finite  number 
of  arguments  -  and  has  the  boundary  truth  table  values  <|»  ( 0 , y )  =  0  , 

Ot 

4>&(l,y)  =•  y  ,  for  all  y  in  [0,1]  .  An  analagous  form  holds  for 

4>or  -  "orll  :  [0,1]  x  [0,1]  -  [0,1]  ,  (10) 

nondecreasing,  etc.,  bounded  below  pointwise  by  max,  and  having 
boundary  truth  table  values  <|>or(0,y)  =  y,  <|>or(l.y)  =  1  »  for  all  y 

in  [0,1]  . 

The  above  functions  are  called  negations  (with  often  the  added 
property  of  being  an  involution),  t-norms ,  and  t-conorms,  respec¬ 
tively.  (See  section  2.3.6  for  various  properties  of  these 
operators . ) 

(ii)  If  il  •  il  is  not  truth  functional,  then  the  evaluation  in 
eq.  (6)  does  not  hold  and  a  more  complicated  evaluation  procedure  is 
valid.  One  example  of  this  is  Probability  Logic  (section  2.3.9), 
where ,  e . g . , 

iiSj  or  S2it  =  il  Sa ll  +  llSjll  -  HS1  &  S2li  ,  (11) 

il  not  SH  *  1  -  HSU  ,  (12) 

but  in  general  there  is  no  fixed  <f>&  ,  not  dependent  on  any  parti¬ 
cular  S1  or  S2  such  that 

II  S  j  &  S  2 II  -  «J)a(llS1ll  ,  II  S  2  H  )  , 
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where  S^S^S  are  any  sentences.  (See  [214]  and  section  2.3.3 

for  further  discussions  concerning  truth  functional  vs.  non-truth 
functional  evaluations.) 

In  addition,  for  a  given  set  of  natural  language  connectors, 
more  than  one  semantic  evaluation  function  may  be  used  throughout  a 
given  sentence  or  in  certian  different  sentences. 


SOME  LANGUAGE  OPERATORS  AND  RELATIONS 

In  this  section,  some  common  (but  by  no  means  exhaustive) 
language  operators  and  relations  are  considered. 

(a)  Linquistic/loqical  connectors. 

The  basic  connectors  representing  negation  (not),  conjunction 
(&) ,  disjunction  (or),  implication  (if(  )  then  (  ))  ,  have  already 
been  introduced.  The  last  could  also  be  defined,  as  in  the  classi¬ 
cal  logic  case,  in  terms  of  "not"  and  "or".  More  compound  operators 
such  as  "iff"  may  also  be  defined.  Purely  linguistic  connectors 
such  as  "although  "  and  "but"  can  be  defined  entirely  in  terms  of 
the  basic  connectors  also.  For  example,  "although"  may  be  iden¬ 
tified  with  implication  and  "but"  with  conjunction,  with  some 
possible  modifications. 

(b)  Hedges. 

Hedges  are  intensifiers  or  modifiers  operating  on  attributes. 
If  one  lets  "hedge"  represent  generically  any  hedge,  such  as 
"extremely",  "very",  "little",  "quite",  then  any  choice  of  semantic 
evaluation  function  11*11  leads  to  the  function  'f’hedge  =  llhed9en 

[0,1]  -♦  [0,1]  .  Some  controversy  exists  concerning  how  to  generate 
spectra  of  hedges  from  a  neutral  hedge,  where  exponentiation  and 
translation  parameter  families  have  been  compared  empirically  as 
candidates  [150],  [147],  [167].  An  alternative,  and  perhaps  more 

general,  approach  is  to  consider  first  the  simple  hedges 
corresponding  to  Integral  iterations  of  conjunction.  Thus,  for  any 
positive  integer  j  ,  and  any  sentence  S  =  <x  e  A>  ;  x  and  A  are 
as  before : 

S(J)  =  "x  has  property  A  to  the  jth  intensity" 

=  "x  has  property  A ^  ^ " 

=  (x  €  A( ^ )  .  (13) 

In  turn,  assuming  truth  functionality  here, 

»S{  j)H  =  'VV*) . VX)  }  •  (14) 

However,  for  the  choice  =  min,  no  change  in  semantic  value  for 

the  jth  intensity  is  reflected  here!  On  the  other  hand,  if  *  is 

& 

an  Archimedean  t-norm  such  as  prod  (i.e.,  ordinary  product  with 
respect  to  its  arguments)  then  it  follows  from  the  canonical  repre¬ 
sentation  (see  2.3.6)  that  there  exists  a  continuous  monotone 

decreasing  function  h  :  [0,1]  -«  R+  with  h(0)  z  -h»  and  h(l)  =  0 
such  that 
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$&(x1,...xr)  =  h_1 (min(h(x1)+. . .+h(xr) ,  h(0)))  .  (15) 

for  all  x^  in  [0,1]  ,  i  =  ,  n  arbitrary  positive. 

(Conversely,  any  choice  of  such  an  h  generates  an  Archimedean  <J) 

as  in  (15),  where  one  need  only  take  n  =  2.)  It  follows  immedi¬ 
ately  that  (15)  implies  that  j  in  eq.  (14)  may  be  replaced  by  any 
positive  real  number  so  that  (14)  becomes 

iiSlJ,n  =  h-1(min(  j-h(<t>A(x)  )  ,  h(  0 )  )  )  .  (16) 

Anaiagous  forms  may  be  obtained  relative  to  <t>or  ,  and  negation  as 

well  may  be  employed.  In  (16),  when  j  >  1  ,  S^o^  can  be  called 

an  intensification,  where  somewhat  arbitrarily,  one  denotes 

"very(S)"  as  S^o^,  "very  very(S!"  as  "very(very(a) ; 1  =  S^2^o^  , 

etc.  When  j  <  1  ,  S^l^  similarly  can  be  identified  with 

"little  of  (S)",  etc.  Some  tie-ins  between  hedges  and  quantifiers 
will  be  discussed  in  subsection  (e). 

(c)  Modal  operators ■  (See  [245]  for  background.) 

Alethic  modality  concerns  itself  with  the  spectrum  -  together 
with  negations  -  of  indicativeness.  Thus  for  example:  "impossible," 
"improbable",  "possible",  "is",  "likely",  "probable",  "certain"  is 
one  such  collection.  Indeed,  correspondences  have  been  established 
between  a  simple  numerical  scale  of  subjective  confidences  between 
0  and  1  and  such  alethic  forms  for  a  number  of  applications 
(personal  observations).  Using  negation,  the  operators  of  necessity 
and  entailraent,  among  others  ,  may  be  defined  [245].  Deontic  modal¬ 
ity  concerns,  analagously,  the  spectrum  of  permission  or  obligation. 
Other  modal  families  of  operators  may  concern  hope,  desire,  hate, 
etc . 


In  any  case,  a  reasonable  way  to  generate  such  families  or 
spectra  of  modal  operators  is  to  choose  some  base  or  anchor  within  a 
given  family,  denoted  as  modalQ  ,  say,  and  simply  define 

modal  =  hedge(modalQ)  (17) 

where  hedge  is  some  suitably  chosen  modifier,  as  in  subsection  (c  , 
depending  of  course  on  modal.  Hence, 

S  *  modal (x  «  A) 

=  ( x  «  modal ( A) ) 

*  (x  «  hedge(modalQ{A) ) )  ,  (18) 

with  semantic  evaluation,  assuming  truth  functionality, 

115,1  =  Vdal<Vx>> 

^hedge^modalo^A*x^  ^  '  (19) 
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(d)  Temporal  Operators.  (See  [215]  for  the  related  area  of 
temporal  logics.) 

Consider  first  the  case  for  past  time  operators  and  in  parti¬ 
cular  the  expression 

S  =  "y  had  property  A"  .  (20) 

Suppose  that  A  is  a  generalized  subset  of  domain  X  =  Y  x  R 

where  Y  is  some  fixed  population  (an  ordinary  set)  and  R  ,  the 
negative  real  with  zero,  represents  the  flow  of  time,  with  the 
present  being  identified  with  t  =  0  .  It  is  also  supposed  that 

$A  :  X  -*  [0,1]  is  known.  Thus  for  any  t  «  R  ,  the  sentence 

S  =  "y  had  property  A  at  time  t"  (21) 

has  the  semantic  evaluation 

»St»  =  <t>A ( y ,  t )  ,  (22) 

for  any  ,  «  Y  .  Next,  identify  "was"  as  a  generalized  subset  of 
R  ,  so  that 


II was II  -  $was  :  R~  [0,1]  (23) 

is  some  monotonically  decreasing  function  with  <t)Was*-a>*  =  1  and 

^was'0^  =  0  •  At  this  point  it  should  be  remarked  that  empirical 

investigations  have  to  be  made  to  determine  what  the  actual  member¬ 
ship  functions  involved  in  this  modeling  -  and  all  previously 
mentioned  models  -  are  numerically.  Putting  together  eqs .  (21)  - 

(23)  yields  the  reasonable  interpretation  for  eq.  (20)  : 

S  =  Or  (Sj.  &  <t  €  was>)  ,  (24) 

fover  all] 

It  in  R  J 

which  under  the  usual  trutn  functionality  assumptions  yields 

"s"  =  <t>or  (4>- (<fA{y.  t)  .  $was ( t )  )  >  •  (25) 

("over  all] 

It  in  R  J 

Note  also,  that  in  practice,  R  will  be  replaced  by  a  suitable 
discretization,  unless  $  =  max  is  chosen.  (The  problem  of 

extending  t-norms  and  t-conorms  to  a  continuum  of  arguments  is 
discussed  in  section  8.4.  A  related  result  may  also  be  found  in 
[91'''].  section  5.)  Similar  analysis  can  be  carried  out  for  remote 
past,  future,  future  anterior,  and  many  other  temporal  relations. 

(e)  Conditioning  and  quantification. 

Zadeh's  contributions  to  this  area  have  already  been  mentioned 
(  286).  See  also  the  discussion  of  other  approaches  in  [51],  pp . 
138-140.  The  approach  presented  here  is  quite  general  and  reduces 
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to  Zadeh's  and  others  for  particular  evaluations.  Let  A  be  a 
generalized  subset  of  base  space  y  and  B  a  generalized  subset  of 
Y  .  Let  quant  stand  for  any  quantification  involving  percentages 
such  as  "some",  "all”,  "few”,  many",  "sometimes",  "often",  "most", 
"about  3.4",  "0.456",  etc.  Let  pop  be  a  fixed  population  of 
individuals  (an  ordinary  finite  set)  and  suppose  that  measurement 
functions  f  :  pop  -*  X  and  g  :  pop  -♦  Y  are  given  so  that  for  any 
z  in  pop  , 

(z  e  A)  =  "z  has  attribute  A"  =  "f(z)  has  attribute  A" 
and 

(z  «  B)  =  z  has  attribute  B"  =  "g(z)  has  attribute  B".  (26) 

Furthermore,  pop  can  be  considered  to  be  an  element  -  in  the  ordin¬ 
ary  sense  -  of  a  super-universal  set  Pop  ,  the  collection  of  all 
populations  of  possible  interest.  In  turn,  A  and  B  may  also  be 
considered  generalized  subsets  of  Pop  ,  so  that  for  any  member  of 


Pop 

of 

,  such 
pop  in 

as 

A 

pop  ,  one 
and  in  B 

can  define 

as 

in  a  reasonable  way  membership 

(pop  e 

A) 

=  Or 

( (Z  e  A)  & 

(Z  €  wt) ) 

over  alii 
z  «  pop  J 


and 

(pop  «  R )  =  Or  ( z  «  B)  &  (z«  wt ) )  ,  (27) 

(over  all] 
lz  «  pop  J 

where  wt  is  some  generalized  subset  of  pop  representing  weighting 
of  importance  of  each  individual  for  either  attribute  A  or  B  (as¬ 
suming  here  for  simplicity  that  wt  is  the  same  for  both  attributes) . 
If  equally  likely  weighting  is  desired,  $>wt  s  l/card(pop)  .  Thus, 

under  the  usual  truth  functionality  assumptions,  it  follows  that 

*A(pop)  -  "(pope  A)  ll  =  <$>or  <V*A(f(z))'  <f>wt(z)))  (28) 

(over  alii 
lz  «  pop  J 

with  a  similar  expression  holding  for  <PB(pop)  ,  where  f  is  re¬ 
placed  by  g  .  Similarly,  the  evaluation  of  t.^afpop)  is  given, 

rii  1 D 

if  no  interaction  is  assumed  between  A  and  B  (see  [51]  for 

further  details)  as: 

<?>AnB(pop)  =  11  (P°P  €  APB )  ll 

“  $or  ( *& ( ^ (  f  (z)  )  ’  V9(z,)'<f,wt(z)))  ‘  (29) 

(over  alii 
lx  «  pop  J 

The  sentences 

=  "individuals  have  A  ,  given  individuals  have  B" 

"  (pop  «  A  |  pop  e  B)  (30) 
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and 


S2  =  "If  individuals  have  B  they  also  have  A" 

=  "If  (pop  e  B)  then  (pop  e  A)" 

=  (  ( pop ,  pop )  e  (B  =*  A ))  =  (  ( pop  €  B )  »  ( pop  «  A )  )  (31) 

are  slight  variations  of  each  other.  The  first  is  an  example  of 
conditioning,  where  here  conditioning  is  defined  as  in  section 
2.3.8.  Thus  the  semantic  evaluation  HS^i  satisfies  the  relation 


^AnB^pop^  =  MS1  &  (pop  €  B)H 
=  ^O'S^l  •  $B(P°P)  ) 


(32) 


The  second  is  evaluated,  as  before,  as 


HS2il  =  ^((^(pop)  ,^A(pop)  )  ,  (33) 

under  the  usual  assumptions.  Then,  a  sentence  such  as  "Most  ships 
that  have  long  hulls  also  have  maneuvering  problems"  may  be 
expressed  in  the  general  form 

S3  =  quant (Sj)  or  S4  =  quant (S2)  ,  (34) 

leading  directly  to  the  evaluations  (under  truth  functionality 
assumptions) 


"V 

and 

IIS  .11 

4 

where 


•quant 


<t> 


quant 


(»s1n ) 

(»S2li)  , 


(35) 


^ quant  *  11  quant «  :  [0,1]  -*  [0,1] 


(36) 


is  obtained  beforehand.  For  example,  if>r 


is  conveniently  modeled 


as  a  unimodal  normalized  function  about  3/4  ,  while  <p 


is  a  non- 


decreasing  function,  being  zero  over  [0,1/2]  and  then  becoming 
monotone  increasing  over  [1/2,1]  ,  where  =>  "about  3/4"  and 


Q2  »  "most". 


Zadeh's  fuzzy  cardinality  approach  to  quantification  Is  ob¬ 
tained  by  choosing  =  prod  ,  $wt(z)  =  l/card(pop)  ,  for  all  z 

in  pop  ,  and  by  choosing  =  bndsura  (i.e.,  for  any  vi',','vn  *n 

[0,1]  ,  bndsum(v  , . . . ,v  )  =  min( 1 ,v1  +  . . . +v  ) )  : 

3  ( 4*.  (  f  ( z ) )  •  <t»„  ( g  ( z ) ) )  /  5  ( 

fall  zl  A  fall  z] 

lin  pop J  lin  popj 


<fB(3(z))>)  • 


HSJI  ■  <t>  A 
3  T quant ' 


(37) 
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Finally,  it  should  be  noted  that  ambiguity  arises  in  the  model¬ 
ing  of  exact  quantifiers.  For  example,  "all"  can  be  approached  as 
above  through  the  function  “  a.  i  (Kronecker  delta  function 

for  1  )  or  it  can  be  modeled  by  the  hedge  corresponding  to  the 

operation  S ^  ^  for  any  sentence  S  ,  where  here  j  a  card(pop)  , 
i  .e.  , 

"all  z's  have  A  "  =  &  (<$>,(f(z)))  .  (38) 

fover  all") 

Lz  in  popj 

If  "softening"  is  really  intended  as  in  "about  5/7"  for  "5/7", 
"almost  all"  for  "all",  "a  few"  or  "there  is",  etc,,  then  the  ap¬ 
proach  given  in  this  subsection  is  most  appropriate.  Conversely,  if 
an  exact  cardinality  is  specified  as  in  "at  least  2"  and  is  meant 
literally,  then  combinatoric  considerations  have  to  be  made: 

"At  least  two  s's  in  the -population  which  have  B  , 
have  also  A" 

=  Or  ( ( z 1  «  A ) & ( z "  e  A)|(z'  €  B)&(z"  e  B ) )  . 

(over  all  z^z""} 

(.in  pop,  z'j^z"  J 

(39) 


( f )  Verb  and  predicative  relations. 

Three  different  approaches  to  the  modeling  of  such  relations 
are  presented  here. 

(1)  The  relations  may  be  defined  operationally  -  i.e.,  only 
directly  through  a  membership  function.  For  example,  the  binary 
relation  "runs  to"  as  in  "John  runs  to  the  store"  can  be  defined 
over  the  domain  X  =  Y  x  Z  ,  where  Y  is  some  relevant  human 
population  and  Z  is  a  collection  of  possible  objects  of  the  verb 
"run  to"  . 

(ii)  The  relations  may  be  defined  indirectly  through  the  use 
of  measurement  functions,  as  introduced  earlier.  Thus  "gross", 
"fat",  "small",  depending  of  course  on  the  context,  can  be  directly 

defined  on  the  domain  R+  x  R+  after  introducing  the  natural  mea¬ 
surement  functions  f  :  pop  -»  ,  g  :  pop  -»  R+  ,  representing 
height  in  inches  and  weight  in  pounds,  respectively. 

(iii)  The  relations  may  be  analyzed  further,  analogous  to  a 
dictionary  definition  of  a  relatively  compound  concept  in  terms  of 
more  primitive  ones.  In  turn,  these  relations  could  be  used  to  form 
constraints  between  the  components,  which  would  be  then  modeled. 
The  usefulness  of  this  approach  remains  to  be  established. 


EXAMPLES  ILLUSTRATING  SOME  OF  THE  PRINCIPLES 

The  above  stated  principles  serve  as  guidelines  in  the  modeling 
of  natural  language  information.  In  practice,  much  ingenuity  must 
be  exercised  (in  a  sense,  this  is  an  art,  based  upon  intuition)  in 
properly  capturing  the  essence  of  the  meaning  of  a  given  sentence. 
Such  will  continue  to  be  the  case  until  a  universal  parsing  pro- 


Summary  of  Main  Resalts  and  Open  Questions 


589 


cedure  is  discovered  (see  the  comments  earlier  in  the  previous 
subsections ) ! 

teyrole  1. 


* 

Consider  the  compound  sentence  S  in  the  Introduction: 

Let:  pop^  =  set  of  all  days  of  interest  ,  (40) 

pop2  =  set  of  all  ocean  regions  of  interest 

=  {V,W, . . . )  ,  (41 ) 

pop3  *  set  of  all  submarines  of  interest,  (42) 


X 


( range  of 
x  ( range 
x  ( range 
x  ( range 
x  ( range 

s  R  x 


possible  temperatures  in  degrees) 
of  possible  wind  velocities  in  m.p.h.) 
of  Vs  possible  representing  cloudiness,  etc.) 
of  possible  no.  of  representing  precip.  inten.) 
of  average  maximal  visibility  in  miles) 

x  [0,1]  x  R+  x  R+,  representing  weather  measurements. 


(43) 


Y  = 

£ 


( range  of 

+  + 

R  X  F  . 


wave-chop  heights )x (range  of  max. water  vel . ) 
representing  sea  state  conditions. 


(44) 


with  also  domains  V,W, . . .  £  R2  (in  latitude  and  longitude). 
Also  define  (errorless)  measurement  functions. 


wem  : 

:  popj 

X 

POP2  - 

X 

,  weather  measurement  funct. 

(45) 

ssm  : 

:  popj 

X 

pop2 

Y 

,  sea  state 

meas . 

function, 

(46) 

loc  : 

:  pop3 

vuwu  •  •  • 

geolocation 

me  as . 

function  • 

(47) 

In  particular,  for  any  «  pop^  »  5  =  1,2, 

wemfZj.Zj)  =  (wemjfZj ,z2) . wemgtZj.Zj))  ,  (48) 


so  that  wenjIZj.Zj)  is  the  av.  max.  visibility  during  day  z.±  in 
region  z2  (z2  =  V  or  W)  . 

Next,  define  generalized  set  C  by,  for  all  z^.  e  pop^  , 
j  =  1.2  , 

^C(zl '  z2 )  =  <f&(^poor(wem(zl'z2))'  *rel  h.(««(*rz2)))  (49) 

turb. 


noting  that  * _  must  be  modeled  and 

poor 
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^rel  h.^x^  ~  ^hedge.  ^normal ^ x^  ' 
turb . 


all  x  e  [0,1]  , 


for  some  properly  chosen  hedge^  ,  etc. 

Define  generalized  set  D  ,  where  for  any  z2  in  pop2  < 


W  "  *or 

[over  all 
z3  ln  P°P3 


( M  z3  was  in  z2» )  , 


where 


H  z-  was  in  z,n  =  A 


(all  t  in  R  ) 


\  ^ 


Define  generalized  set  E  ,  for  any  z£  ,  Zg  e  pop3  , 


♦b(z3'z3}  *  ! 


[geo  1 
lmatch-Bj 


(wtd  dist { loc{ Zg ) , loc ( Zg ) ) )  , 


where  $  arises  from,  typically,  hypotheses  testing  of 

[geo  1 
(.match-Bj 

equality  of  means  from  gaussian  data,  and  is  thus  exponential  in 
form  (see  Chapters  8  and  9  for  further  details  relating  statistical 
procedures  with  this  modeling).  $z  ^  (loc(Zg)  ,t)  ,  typically  may 

be  obtained  as  the  probability  function  evaluation  corresponding  to 
the  output  of  a  Kalman  filter,  for  sensor  system  A. 

Next,  define  generalized  set  F  ,  where  for  all  z  €  pop1  , 

z2  «  pop2  ,  z£  ,  z£  «  popg  ,  s  «  [0,1]  -  representing  possible 

correlation  levels;  and  for  all  <t>„  :  [0,1]  [0,1]  ,  for 

Ki 

j  =  1,2,3  , 

$_(  z,  ,  z0  ,  zi  , z"  , o  ,  A  ,A„  ,A  ) 

F  1  2  3  3  Kj  Kg  Kg 

(^(1'5(fK1l'»Dlz2|l-'»K2('»s'23’25l>>''l’h=dge2«,c0rr<*'>"!  • 


Then  define  generalized  set  G  ,  by  for  all  z£,Zg  6  P°P3  * 
Zg  =  V  ,  and  for  all  $K  ,  j  «  1,2,3,  and  all  e  , 
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<l>  ( zi ,  zll ,  0  ,  ) 

Q  3  3 


fall  Zl 
Un  pop2 


,z^,z",e  ,(pK  ,  <J>K  ,<t»R  ))  . 

12  3 


(55) 


Define  generalized  set  H  by  the  conditioning  procedure  where 
for  all  z^.z'^.e  ,$K  ,  j  =  1,2,3  ,  with  z^  -  W  , 


WH'Z3'°  *%'%'%>  =  ^most(P°Pl  6  f<-Iz3'z3','\'\'^3)) 


^raost  ^  *or  !^^F(zl'M'23'zS'#^K/\,'^!^wt(zl,,))  * 
fall  2a]  123 


all  z1 
in  pop.. 


(56) 


Finally,  define  generalized  set  L  ,  corresponding  to  inference 

rule  S  (0|zi,z")  ,  indicating  the  functional  dependencies,  as,  for 

3  3 

all  Z3.Z3  «  P°P3  •  6  e  [0/1]  < 

$L(0 ,z^,z£) 


*  11 S*  ( 0  | Zg , Zg )  11  =  <!>&(<|>g(z£,z£,0  ,<}> 
V®3'®3'*'* 


/<£ 


flow  1 

fhigh  I 

LeffectJ 

LeffectJ 

,<})iden)  ' 


<♦ 


('  ))/  ♦ 


( improve) 

flow  -  1 

fhigh  -l 

LeffectJ 

LeffectJ 

(57) 


with  the  required  models  assumed  obtainable  for  the  hedges 
"improve",  "low  effect",  "high  effect",  etc. 


Example  2. 


*  * 

Consider  the  compound  sentence  S  in  the  Introduction: 

Let  all  notation  be  as  in  Example  1,  where  required: 

Without  loss  of  generality,  fix  time  interval  [a,b]  compati¬ 
ble  with  a  fixed  single  day  Z*  «  pop1  .  Let 

pop^  *  set  of  all  surface  ships  of  interest.  (57*) 

Assume  that  z 4  is  our  own  ship  and  z  is  the  target  one  with 

*  * 

z4>z4  «  pop4  •  Fix  also  region  T  «  region  around  the  Straights 

* 

of  Skagerrak,  and  let  T  «  pop2  *  Suppose  also  that 

Z  *  (range  of  possible  hul  lengths  x  ••• 

x  (range  of  possible  side-insignia  colors)  x*** 
x  (range  of  possible  descriptions-locations  of 
prominent  objects  on  ship  surf act)  x • • • 
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5  R+x* •  •  x  { .  .  .  , red .purple, .  .  . }x •  •  •  x  ( .  . . , sq. box, front , . . . , 

(oval-dome, rear) .... }x- •  •  ,  (58) 

and  (errorless)  measurement  function  Is  given 

des  :  pop4  -»  Z  ,  a  description  function,  (59) 

where  for  any  z4  in  pop4  , 


des(z4)  =  (des1(z4) , . . . ,desg(z4) , — ,des1? (z4) , . . . )  .  (60) 

Let  "foggy"  be  a  generalized  subset  of  the  range  of  the  average 
maximal  visibility  in  miles,  for  simplicity.  Define  genralized  set 
A  ,  where  for  any  times  t*  ,  t",  t*‘*  ,  and  z4  , 

4>a  ( t  *  ,t",t*  •  ■  ,z4)  «  ,  (z4).  <fs,2(t"-t')  . 

4 

fholds  contract) 
lover  [f,t"]  J 

$small(t,"'t")'  *  *  *  (loc(z  *),f“)  , 

^posterior J wemg (Zj  ,T  )€ foggy j 


^was  (**'*"' t  ‘  ‘  ‘ ))  •  (6D 

and  in  turn,  define  generalized  set  B  ,  where  for  all  z4  , 


Vz4>  =  *or  ((t>A(f,t”,t-*1,z4))  .  (62) 

(over  all  t‘,t",t,,‘,  with) 
la  s  t’  s  t"  s  t‘ • ■  s  b  J 


hext,  define  the  generalized  set  C  ,  where  for  all  z 


<t>c(z4)  =  $&(4> 


posterior 

*  .  < 
(wem.fz,  , T  )«foggy)& 

descrip3 

. 

O  A 

{ maybe (observ( des 3(z4) )=purple) ) 

*  .  (des  „(z  ))  , 

posterior  (wem  (z  ,T  )«foggy)& 
descr  i  d 

h17  (maybe(observ(des17(z4) )=( oval-dome, rear) ) 


* 

conditional 
set  of  ship 
.names 


des3(z4) 


deS17(z4) 


(des2(z4) ) 


(63) 


where  des2  is  the  naming  description  such  as  "Jones",  "S.S. 

Jackson",  etc.,  and  where  all  conditional  or  conditional  posterior 
generalized  sets  as  above  must  be  appropriately  modeled.  Then, 

finally,  define  the  generalized  set  M  ,  corresponding  to  informa- 

*  * 

tion  S  (z.)  »  for  all  zA  In  pop.  ,  as 

4  4  4 
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W  "  "S**(z4>"  =  VW'  *C(Z4,)  •  (64) 

Modeling  of  Inference  rules  such  as  given  in  Example  1  and 
error  distribution  information  as  given  in  Example  2  can  be  used  to 
extend  the  usefulness  of  combination  of  evidence  procedures.  It  is 
essentially  the  semantic  evaluation  of  the  disjunction  over  all 
nuisance  parameter  values  of  the  conjunction  of  all  relevant  infor¬ 
mation  -  in  the  case  in  Chapter  9,  being  the  conjunction  of  all 
relevant  inference  rules  connecting  matching  levels  for  attributes 
with  correlation  levels  and  error  tables  in  the  form  of  possibility 
or  membership  functions  for  the  attributes  in  posterior  forms,  given 
observed  data  [80], 


SUMMARY  AMD  COHCLPSIONS 


An  outline  has  been  presented  for  the  modeling  and  semantic 
evaluation  of  linguistic  information.  The  implementation  of  this 
depends  heavily  upon  the  appropriate  modeling  of  the  relevant  com¬ 
ponent  membership  functions  of  the  generalized  sets  involved.  (See 
[51],  pp.  255  -  264  for  approaches  to  the  latter  problem.)  Much 
work  remains  to  be  done  in  the  general  area. 
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Knowledge  representation,  utilization,  acquisition  (8.1) 
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L 


Law  of  excluded  middle  (2.2.2,  2.3.3,  5.2,  7.3,  7.4,  7.5,  10. 2A) 
Law  of  large  numbers  (8A,  10. 2. A (II)) 

Lebesgue-Stielties  logic  (LSL)  (2.3.9(B)) 

Level  sets  (5.2) 

Liar's  paradox  (10.2B) 

Lifting  operators  (Manes,  etc.)  (2.3.7,  7.2) 

Linguistic  variable  mapping  (Ling)  (2.3.8) 

Linguistic  variables  (2.3.8) 

Local  logic  (2.3.5) 

Locally  Archimedean  (2.3.6) 

Logical  connectives  (Loc)  (Preface,  1.1,  2.2.2,  2.4.2) 

Logical  constants  (2. 2) 

Logical  equivalence  (in  special  sense)  (2.4.2) 

Logical  (or  material)  implication  (2.3.3) 

Logical  axioms  (2.3.4) 

Logically  complete  (2.3.3) 

Logically  sound  (2.3.3) 

Logic ( s )  (2.2.2,  2.3.2,  2.3.3,  10. 2B) 

Logos  (2.4.2) 

Lbwenheim-Skolem  property  (2.3.3) 

Lower  probabilities  (1.2,  3.3,  4.2,  8.4) 

Lukasiewicz  logics  (Ln,  n=l,2,..,  H^)  (2.3.3,  10. 2A  (I)) 

L-valued  logics  (2.3.5) 

A -additive  ,  A -fuzzy  measures  (Sugeno)  (3.3) 


M 

Manes'  theory  of  uncertainty  ("fuzzy  theories")  (Preface,  7.2, 
10. 2B) 

Many-sorted  formal  language  (2.2.2) 

Marginal  measures  for  LSL  (2.3.9(B)) 

Marginal  random  sets  (4.2,  6) 

Markof f-Kernel  (Klement  et  al . )  (3.3) 

Matching  tables  (9  (C),  (D),  (E)) 

Material  implication  (2.3.3) 

Matheron's  theorem  (4.3,  5.3) 

Max  bounded  sum  (maxsum)  (t-conorm)  (2.3.6) 

Max(imum)  (t-conorm  operation)  (2.3.6,  8.4) 

Maximal  (or  cross)  entropy  solution  to  one  point  coverage  problem 
(1.1,  3.1,  5.4) 

Maydole-Hay  set  theory  (2.3.4) 

Measure  of  central  tendency  of  a  dispersion  (2.3.6,  2.3.9(B), 

8.4  ,  9(E),  10. 2B) 

Membership  function  (Preface,  2.3.5,  2.4.2,  6,  7.5,  9,  10) 
Membership  relation,  symbol  (2.2.2,  2.3.2,  2.4.2) 

Meta-level  of  analysis  (Prologue) 

Min  bounded  sum  (minsum)  (t-norm)  (2.3.6) 

Min(imum)  (t-norm  operation)  (2.3.6,  8.4) 

Minimal  entropy  solution  to  one  point  coverage  problem  (5.4) 
Modal  logic  (2.3.3) 

Modal  operations  (2.6,  10(App)) 

Model  (2.3.2,  2.3.3) 

Model-consistent  (2.3.3) 

Model-inconsistent  (2.3.3) 

Modeling  dispersions  (membership  functions)  (1.1,  2.3.5,  2.6, 
7.5,  9) 
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Modular  (see  Frankian) 

Modus  ponens  (logical  inference  rule)  (2.3.3) 

Modus  tollens  (logical  axiom)  (2.3.4) 

Moments  of  dispersions  (8(A),  9(E),  10. 2B) 

Monad  ( used  by  Manes )  (7.2) 

Mono (morphism)  (2.4.2) 

Monotone  measures  (3.2) 

Monotone  Choguet  capacities  (3.2) 

Morphisms  (see  arrows) 

Moyinihan's  theorem  (2.3.6,  2.9(B) (App) ) 

Multiple  point  coverage  function  (4.2,  4.3,  10. 2B) 

Multi-valued  logic  (1.2,  2.3.4) 

Multi-valued  mapping  and  integration/expectation  of  random  sets 
(4.4) 

Mutually  distributive  (2.3.6) 


H 


#• 

Naive  set  theory  (2.3.4) 

N-ary  relations  (2.6,  10(App)) 

Natural  language  and  its  modeling  (1.1,  1.2,  2.1,  2.6,  10.2A(I), 
10 (App) 

Natural  logic  (2.2.1) 

Natural  topologies  on  ordinary  membership  function  and  set  classes 
(4.2) 

Natural  transform  between  functors  (2.4.2) 

Negation  or  complement  operation  (Preface,  1.1,  2.2.2,  2.3.6) 
Negation  consistency  (2.3.3) 

Negoita-Ralescu  representation  theorem  (5.2) 

Nested  random  sets  (5.2,  6,  9(B)) 
n-copula  (See  Copula) 

n-homomorphism/extension  (Manes)  (7.2) 

Noise-free  theory  (Manes)  (7.2) 

Nondecreasing  binary  function  (2.2.2,  2.3.6,  2 . 3 . 9 (B) ( App) ) 
Non-interaction  (2.3.8,  2.3.9(B)) 

Nonstandard  analysis  (2.3.3) 

Non-truth  functional  system  (Preface,  2.3.3) 

Normable  functions  (2.3.9(B),  5.2,  7.2) 

Novak's  set  theory  (2.3.4) 

Nuisance  parmeterr-  (8.3,  9(D),(E),(F)) 


0 


Objects  (of  categories)  (2.2.2,  2.4.2) 

Observed  data  (1.1,  9) 

One-point  coverage  (filter)  class  (4.2,  5.2,  9(B)) 
One-point  coverage  problem  (1.2,  5.2) 

Open  wff  (2.2.2) 

Ordinal  sums  of  t-norms,  t-conorms  (2.3.6) 

Ordinary  set  topos  (SET,  Set)  (2.4.1,  2.4.2) 


P 


PACT  algorithm  (9) 

Parsing  principle  (2.1,  2.2.1,  2.3.4,  10(App)) 
Partial  sets  (Gilmore)  (1.2) 

Phrase  structure  (2.2.1) 
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Platonism  (2.3.1) 

Plausibility  (=  upper  probability)  measure  (3.2,  3.3,  4.2) 
Polncard  (probabilities  of  unions  of  events  in  terms  of 
intersections  and  vice-versa)  (3,  4) 

Positive  increment  monotone  (PM)  (2.3.9(B)  (App) ) 

Possibility  distribution,  measures  (3.3) 

Possibilities  (Preface,  1.1,  5.2,  9(B)) 

Posterior  generalized  (fuzzy)  set  estimators  (8.4) 

Predicate  (=  relation)  symbols,  functions  (2.2.2,  2.3.2) 
Primitive  random  set  (4.2) 

Principle  of  maximum  (cross)  entropy  (3.3) 

Probability  logic  (PL)  (2.3.3,  2.3.5,  2.3.9(A),  3.1,  7.4) 
Probabilistic  reasoning  (2.5) 

Probabilistic  system  (2.3.5) 

Prob(ability)  sum  (probsum)  (t-conorm)  (2.3.6) 

Process  (Preface) 

Prod(uct)  (t-norm  operation)  (Preface,  2.3.6) 

Products  holding  for  a  category  (2.4.2) 

Projections  (in  different  senses)  (2.3.8,  2.4.2,  2.6,  3.2,  3.3, 
4.2,  7.3,  8.3,  9(E)) 

Projective  object  (2.4.2) 

Property  systems  (Allen)  (2.3.4) 

Proposition  (Mane?)  (7.2) 

Propositional  completeness  (Manes)  (7.2) 

Prototype  (concept)  (8.1,  10.2A(I)) 

PROF  (Zadeh)  (Preface,  2.1,  2.3,  2.5,  2.6,  8,  9) 

Pseudo- complement  (2.3.3) 

Pseudo-inverse  of  a  c.  d.  f.  (2. 3. 6 (App)) 

Pseudo- topos  (=»  logos)  (2.4.2) 

Pullbacks  (2.4.2) 


Q 


Quantifications  (1.1,  2.2.2.,  2.3.3,  2.3.8,  2.3.9(B),  2.6,  10 (App)) 
Quantifications  for  LSL  (2.3.9(B)) 

"Quasi"-logical  axiom  forms: 

Quasi-universal  instantiation, 

Quasi-existential  instantiation,...  (2.3.4) 

Quotients  (2.4.2) 

Quotient  theories  (Manes)  (7.2) 


R 

Rao-Crara&r-Fisher  precision  or  information  boun-1  (10.2  B) 
Random  intervals  (5.2) 

Randomized  tests  (5.5) 

Random  membership  function  (5.1) 

Random  par  1 1 1 i oning  ( 6 ) 

Random  (sub)  set  operations  (6) 

Random  (sub)  sets  (1.2,  3.2,  3.3,  4,  5,  6,  9(B),  10.2  B) 
Range  of  a  random  set  (4.2) 

Reflexive  statements  (see  self-referencing  statements) 
Relation  symbols,  functions  (2.2.2,  2.3.2,  2.4.2) 

Robbins'  formula  (4.4,  5.5) 

Rules  for  a  theory  (2.2.2) 
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S 

Satisfaction  (2.3.3) 

Schackle's  degree  of  surprise  (1.2) 

Schefe's  agreement  probabilities  (7.5) 

Scores,  score  functions  (Lindley,  etc.)  (10.2A(III)) 
Second-level  syntax  (2.2.2) 

Second-order  probabilities  (1.2) 

Self-referencing  statements  (10.2B) 

Semantic  deduction  (2.5) 

Semantical  domain  (2.3.2) 

Semantic  evaluation  (1.2,  2.3.2,  2.3.7,  10.1,  10. 2B) 
Semi-distributive  t-norms,  t-conorms  (2.3.6,  3.2,  6) 
Semi-truth  functional  systems  (2.3.9) 

Semiotics  (2.1,  2.2.1) 

Sentences  {=  completely  bound  wff's)  (2.3) 

Seguents  ( Gent zen )  (2.2.2,  2.3.3,  2.4.2) 

Set  operations  (2.3.5) 

Set  theory  (general)  (2.2.2,  2.3.2,  2.3.4,  2.4.2) 

Sheaves  (2.4.2) 

Signature  function,  symbol  (2.2.2,  2.4.2) 

Signed  measures  (2.3.9(B)  (App) ) 

Simultaneously  weak  homomorphisms  (6) 

Sklar's  theorem  (Preface,  2.3.6) 

Sorts  (2.2.2) 

Soundness  and  completeness  (deductive)  properties  for  semantic 
evaluations  in  deduction  categories  (2.3.3) 

Space  law  (4.3) 

Span  (2.2.2) 

States  (Preface,  2.3.5) 

Steady  state  of  a  feedback-loop  system  (10.2B) 

Strict  Archimedean  (2.3.6,  2.3.8,  2.3.9(B)) 

Strings  of  wffs  (2.2.1,  2.2.2,  2.3.3,  2.6) 

Strong  bounded  variation  (SBV)  ( 2 . 3 . 9 (B) (App) ) 

Subobject  operator  (2.4.2) 

Subobject  classifier  for  a  topos  (2.4.2) 

Subconstants  (2.4.2) 

Subjective  error  tables  (9(B) , (C) , (D) , (E) ) 

Subjective  inference  rules  ( 9 (B) , (C) , (D) ,  (E)) 

Subjective  probabilities  and  possibilities  (7.3,  7.5,  8.5) 
Subset  coverage  function,  class  (4.2) 

Subset  relations  for  general  sets  (2.3.8,  2.4) 

Subset  topologies  and  a -algebras  (4. 2 (App)) 

Substitution  symbol,  function  (2.2.2,  2.3.2,  2.3.5,  2.4.2) 
Subtheory  map  (Manes)  (7.2) 

Summary  of  topics  discussed  (10.1) 

Superset  coverage  function,  class  (4.2) 

Symmetry  (2.2.2,  2.3.6) 

Syntax  (2.2.2) 

Systems  (Preface) 


T 


Tautology  (2.3.3) 

Tautological  equivalence  (2.3.3) 
t-conorm  or  disjunction  (see  disjunction) 
Temporal  operators  (2.6,  10(App)) 
Terminal  objects  (2.4.2) 

Terms  (basic  and  individual)  (2.2.2) 
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Textured  sets  (Smith)  (2.3.4) 

Theorems  (2.2.2) 

Theory  (for  a  formal  language)  (2.2.2) 

Theory  map  ( Manes )  (7.2) 

t-norm  or  conjunction  (see  conjunction) 

Topos,  topoi  (1.2,  2.3.2,  2.4,  10. 2B) 

Total  (Individual)  variable  map  (2.2.2) 

Total  variation  ( 2 . 3 . 9 (B) ( App) ) 
t-possibility  measure  (3.3) 

Tracking  and  correlation  (1.1,  9) 

Transformational  (grammar)  rules  (2.2.1) 
Transformation  of  dispersions  (2.3.8) 

Transitivity  (full,  partial)  (2.4.2) 

Truth  functions  (2.3.6) 

Truth  functional  systems  (Preface,  1.1,  2.3.3,  7.4) 
Truth  functional  completeness  (2.3.3) 

Truth-maps  (2.2.2) 

Truth  space  (2.3.3) 

Types  (2.2.2) 


0 

Oltraf liter  (see  subset  coverages) 

Uncertainty  logic  (Gaines)  (7.4) 

Uncertainty  measures  (1.2,  2.3.2,  3,  4,  5) 

Uncertainty  measures  as  coverage  functions  (4.2) 
Uncertainty  models  (Preface,  1.1,  2.3.5,  8,  9) 

Uncertainty  of  systems  (Preface) 

Union  of  generalized  sets  (2.3.4,  2.3.5,  2.4.2) 

Universal  Instantiation.  (2.3.4) 

Universal  quantification  (2.2.2,  2.3.2,  2.3.3,  2.3.4,  2.6, 
10 ( App ) ) 

Universes  of  discourse  (1.1,  2.2.2,  2.3.2) 

Upper  probabilities  (1.2,  3.3,  4.2,  8.4) 


V 


Vagueness  (Manes)  (7.2) 

Validity  (2.3.3) 

Valuation  (see  Frank) 

Von  Neumann-GOdel-Bernays  set  theory  (2.3.4) 


W 

Watanabe's  theories  of  uncertainty  (7.3,  10.2A(II)) 

Weakened  Zermelo-Fraenkel  set  theory  (zer,  sero ,  etc.)  (2.4.2) 

Weak- isomorphisms,  -homomorphisms  (5.2,  6) 

Weak  satisfaction/validity  (2.3.3) 

Weak  -completion  of  the  class  of  ordinary  membership  functions 
(2.3.6) 

Weighted-sum  operations  (6) 

Weidner's  set  theory  (2.3.4) 

Weights  of  evidence  (8.3,  8.5) 

Weights  of  importance  (1.1) 

Well-formed  formulas,  terms,  individual  variables,  (Wff,  Wfter, 
Wfv,...)  (2.2.2) 
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Well-pointed  topos  (2.4.2) 
Whorf-Sapir  hypothesis  (1.2,  2.1) 


* 

Yager's  family  of  t-norms,  t-conorms  (2.3.6) 


Z 

Zadeh's  fuzzy  set  system  (Preface,  2.3.5,  and  various  parts  of  the 
text) 

Zermelo-Fraenkel  set  theory  (ZF)  (2.3.4) 

Zero-one  stochastic  process  (4.2,  5.1,  6) 


LIST  OF  SYMBOLS 


The  following  list  of  symbols  contains  essentially  only  the 
most  important  ones . 


Symbol 


Explanation 


4 

*syn 

Th(4)  ,ThKU) 
Symb(4 ) 

H  '  fc  '  t 

Cor e(4 ) 

0b(4) 

Ar(4)  /Ara  /J(l) 

Rel(4) 

Cat (l ) 

DeCat (4  ) 

Foun(4 ) 

Ex 

Loc(4 ) 

Quan ( 4  ) 

ThsynU) 

De(Th(Syn)) 

tfar  (  l  ) 

Wfv(4) 

Wft(4) 


Formal  language 
Syntax  part  of  4  . 

Additional  theory  for  4  . 

Alphabet  or  collection  of  symbols. 

Formal  deduction  symbols. 

Category  of  types  of  things. 

Collection  of  universes  of  discourse 
Collection  of  arrows  between  universes  of 

discourse . 

Collection  of  relations  upon  universes  of 
discourse . 

Collection  of  category  theory  function 
symbols . 

Collection  of  deduction  category  theory 
function  symbols 

Collection  of  function  symbols  representing 
set-theory  concepts . 

Existence  map. 

Collection  of  logical  connectors 
Collection  of  quantifiers. 

Theory  of  syntax. 

Collection  of  deducts  or  theorems. 

Collection  of  basic  variables. 

Collection  of  individual  variables. 
Collection  of  individual  terms. 


Rel(\?ar(t ) ) 

WfatU) 

r[f] 

Wff (4) 

Wfterft ) 

WfexU ) 

FuncSymb(4)  =>  Arr(i) 
Typ { 4 )  =  Ob j ( 4 ) 

Sort (4 ) 

Q  .  (| 

Span(A;Q) 

<f  ,g> 


a 


Collection  of  relation  symbols. 

Collection  of  atomic  well-formed  formulas. 
Substitution  of  term  f  Into  relation  r  . 
Collection  of  well-formed  formulas. 
Collection  of  basic  well-formed  terms. 
Collection  of  well-formed  expressions. 
Collection  of  function  symbols  of  Core(4)  . 
Collection  of  object  classes  of  Core  (4)  . 
Collection  of  basic  types  or  kinds  of 
things. 

(Cartesian)  product  map. 

Exponentiation,  subobject  classifier. 

The  span  of  A  by  operator (s)  Q  . 

Function  product. 

Signature  map. 

(Cartesian)  sum  map. 

Equality  map. 
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List  of  Symbols 


g  x 

€ 

C 

n 

co-c 

n 

Proj,  pro j ,  proji(lxi) 


Natural  transform  (between  functors). 
Membership  map. 

Class  of  all  n-copulas. 

Class  of  all  n-co-copulas . 

,PG  Projection  map. 


O 


not  ,  nt  ,  ~ 
& 

or 

Id,  id. 
Locn(jt ) 

min  ,  a  ,  Ajj 

max  ,  v  ,  vH 


SeqA{l) .  Ser(t) 


Predsym(t ) 
Term(i ) 
Var(ArU) ) 
A x(t) 
RulU) 

SET  ,  Set 
Cons(r ) 
Form(r. ) 
Atom(i ) 
De(ThK(£) ) 


Mo  ,  So 
v  ,  3 
Indterm ( t ) 
Indvar ( L ) 
Indconst (L ) 


Th 

Th. 


I.syn 


•II 
X(l) 
Preord 


syn 


(M 

(4) 


Preord, 


Sent (L ) 

TV 

FV 


BnV 

Open(£ ) 
(c,a) 

ii  •  ii  ,  it  •  ii 

w 


S 

B 


R  _  _ 

II ,syn, 3 
w 


x 


(I) 


Composition  map. 

Truth  (only)  map. 

False  (only)  map. 

Negation  map. 

Conjunction  map. 

Disjunction  map. 

Implication  map. 

Identity  relation. 

Collection  of  n-ary  logical  connectors. 

Minimum 

Maximum 

Double  implication. 

Sequents  class. 

Collection  of  predicate  symbols. 
Collection  of  terms. 

Collection  of  basic  variables  of  Ar(£) 
Collection  of  axioms. 

Collection  of  rules. 

Category  of  ordinary  sets. 

Collection  of  basic  constants. 
Collection  of  quantified  formulas. 
Collection  of  atomic  formulas. 

Class  of  all  deducts  of  Th^ { i )  . 

Most-,  some- operators. 

Universal,  existential  operators. 
Collection  of  individual  terms. 
Collection  of  individual  variables. 
Collection  of  individual  constants. 
First-level  syntax  theory  of  t  . 

Second-level  syntax  theory  of  L  . 

Functor:  Var (l)  -*  Preord. 

Category  of  preordered  sets. 

Collection  of  forr.- .1  sentences  of  L  . 
Total  variable  map. 

Free  variable  map. 

Bound  variable  map. 

Collection  of  open  expressions  of  t  . 
Deduction  category 
Semantic  evaluation  map. 

Source  (or  domain)  map. 

Codomain  map. 

3  deduction  rule  for  level  II  syntax. 
Individual  variable  assignment  map. 
Individual  variables. 


H' 


Exponential  functor. 


List  of  Symbols 
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Sub 

*  .  (Z  *) 
Val  (t  ,  ll -il) 


Mod(Th(i) , L ) 


n 


K . 


(T  5  *) 


n 
o  , 

PL 

LSL 

GC 

Z7  , 

NST 

MH 

NGB 

ZF 

IZF 

Ch 

Wei 

Nov 

4>  - 


ax(X) 


(Z7) 


* 

4> 


P 


nt 


or 


*. 

0(5) 


or 


K) 


V$&  .  P  .  K) 


5(5  ,  p  ,  K) 


Subobject  functor. 

Tautology  equivalence. 

Logical  (or  material)  implication. 

Sequents  valid  for  n  •  ll  . 

Class  abstraction  map  (generalizing 
standard  set  notation) . 

Set  of  models  of  Th(Z)  for  logic  L  . 
Lukasiewicz  logics. 


n-iterated  implication. 

Bochvarian  logics. 

Gddel-Intuitionistic  logic. 

Krdnecker  delta  symbol. 

Probability  logic. 

Lebesgue-Stielt jes  logic. 

Axiom  of  generalized  comprehension. 

Set  theory  axiom  #  7. 

Naive  set  theory 
Maydole-Hay  set  theory. 

Von  Neumann-Gddel-Bernays  set  theory. 
Zermelo-Fraenkel  set  theory. 
Intuitionistic-Zermelo-FraenkeJ  set  theory 
Chapin's  set  theory 
Weidner's  set  theory. 

Novak's  set  theory 

Membership  map/possibility  functor. 
Membership  function  of  attribute  A  . 

Membership  function  of  not  (negation) . 

Membership  function  of  and  (t-norm). 

Membership  function  of  or  (t-conorm). 

Membership  function  of  implication. 

Class  of  all  ordinal  sum  systems  over  5  . 

t-conorm  part  of  ordinal  sum  system 

generated  by  ,  p  ,  K  . 

t-norm  part  of  ordinal  sum  system 

generated  by  <£&  ,  p  ,  K  . 

Ordinal  sum  system  generated  by  ?  ,  p  ,  K 

Membership  function  of  operation  *  . 


♦ 


wtdsum 


V  (P) 
‘’’or,  (p) 
,  s 

'’or ,  s 

% 


Membership  function  of  weighted  sum 
operator . 


Pth 

t-norm 

(Yager) 

Pth 

t-conorm  (Yager) 

3th 

t-norm 

(Frank) 

sth 

t-conorm  (Frank) 

Yager's  family  of  t-norms  and  t-conorms. 
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List  of  Symbols 


a  ,  s>e 
f  ,  re 

j»r 

♦-1(P). 


1 


(q) 

f  ,X. 
1 


» 


f ,  f ,  f ,  ?,  #<f ) ,  n(f ) 


at  the  p-level. 

Affine  operator  ordinal  sum. 

Class  of  all  DeMorgan  fuzzy  set  systems. 
Class  of  all  semi-distributive  t-norms, 
t-conorms . 

Class  of  all  Archimedean  t-norms, 
t-conorms . 

Ap,  A^  ,  the  attributes  corresponding  to 

dispersions  (or  function)  p  ,  ^  • 

The  unity  map  1  over  . 

Measure  induced  by  applying  (t-norm)  <|>&  to 

marginal  functions  f1,..,fn  . 

Extensions  of  f  . 


<x  €  A> , 


A  A  B 
A® 

AS 


B 

Sh 


(x  €  A) 


Zer 


Wff  "x  is  in  A"  or  "x  has  attribute  A" 
Incidence  between  A  and  B  (A  n  B  *  0) 

n-intensif ication  of  A  . 

n-stretching/extensif ication  of  A  . 

Initial  object. 

Terminal  object. 

Sheaves  operator 

Opposite  operator  for  g  (category  theory) . 
Weaker  ZF  set  theory. 


wpt 

wtt,  wtr,  wttr 


*  * 

«  .  <3 

pa 

Bool  (H) ,  »» 
n  n 


Well-pointed  topoi  theory. 
Well-pointed/partial  transitive  theory. 
Partial  order  on  Heyting  algebra  H  . 

Pull-back  relative  to  t  and  a  through 

g  and  f  . 

Adjolnts  of  g  . 

Terminal  arrow  map. 

Collection  of  H-valued  Boolean  functions. 


Higg(H) ,  HIGG(H)  Higg's  topos 

Fuz(H)  ,  Fuz^  ,  Gog(H),  GOG(H)  Categories  representing  fuzzy 


*(X) 

9-(X) 

*(X)  ,  ?(X) 

J  (X) 

H  ,  H 

o 

U  ,  # 

B 

q 

L<?m) 


set  theory. 

Collection  of  fuzzy  or  generalized  subsets 
of  X  . 

Set  of  dispersions  of  X  ;  natural 

X 

topology  on  (0,1) 

Power  class  of  X  *  collection  of  all 
ordinary  subsets  of  X  . 

Collection  of  all  finite  subsets  of  X  . 
Heyting  algebra  (usually). 

Lifting  operators. 

Borel  o-algebra  of  . 

Equivalence  classes  of  f  such  that  |f|q 
is  m-integrable . 


List  of  Symbols 
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,  Bool,, 
n  n 


comb(  •  ) 

C  ,  h  , 

Pr ,  p 

BV 

SBV 

PM 

ND 

g.d. f . 


Class  of  Boolean  polynomials  (classical). 

Fuzzy  set  exterior  a»n  . 

Combinatin  operator. 

Complementation  symbols. 

Probability  operators. 

Bounded  variation, 
strong  bounded  variation. 

Positive  increment  monotone. 
Non-decreasing . 

Generalized  distribution  function. 


J(x,y)  J(x  Ay  x  v  y) 

•<*  q-dif  fe.  'nee . 

Q 

Vr(f,K,d)  Variation  of  f  over  K  w.r.t.  partition  a  . 

Vr(f,K)  Total  variation  of  f  over  K 
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List  of  Symbols 


Total  variation  of 


* (Pf -Rq) 

s(^f  ,Rq) 
L(pf ,  B) 


Class  of  al  ^-measurable  subsets  of  Rq  . 

_  fj ^-measurable  functions  on  Rq  , 

_  //^-integrable  functions  over  B 


(y (c  ,a) ,Sub)  Benabou  extension  of  (c,st)  . 

Prodsum  Probability  sum  t-conorm. 

Maxsum,  maxbndsum  Maximum  sum  t-norm  :  max(0,  x  +  y  -  1) 

bndsum,  minbndsum,  minsum  Bounded  sum  t-norm  :  min(l,  x  +  y) 


V  ’rs 

Ent(S) 

Poss ,  poss . 

c  {x> 


D ( f ) ,  Diag(f) 


Prank's  family  of  t-norms  and  t-conorms . 


(1  -  ( • ) ,  te 


'  "  T&,s  '  ^or.s1  1 
Entropy  of  random  set  S  . 

Possibility  measure. 

Class  of  all  subsets  of  space  containing  x  . 


Filter  class  on 


restricted  to  sets  of 


rng ( S ) 
Lev ( A ) 


Su(A) 


y(m)(f) 


w.  1 .  o  .  g 
w.  r .  t . 


cardinality  j  . 

Null  set. 

Diagonal  part  of  f  . 
Diagonal  embedding  operator. 


Range  of  random  set  S  . 
r ^  Level  map  (a-cut  map) . 

A  random  set  one  point  equivalent  to  A  . 

T(A),  S&(A),  Sor(A) 

Special  random  sets  one  point  equivalent  to  A  . 
One  point  coverage  function  for  S  . 

One  point  coverage  relation  between  attributes  and 
random  sets. 

Random  sets  m-point  coverage  equivalent  to  f  . 
Disjoint  union. 

Truth  functional  system  (neg,  min,  max) 

Truth  functional  system  (neg,  prod,  probsum) 

Truth  functional  system  (neg,  max  sum,  bndsum) 

Truth  functional  system  (neg,  prod,  bndsum) 

is  defined  to  be. 

without  loss  of  generality. 

with  respect  to. 

Collection  of  ordinary  membership  functions  f  wi 


fUj) 


(0,1),  j 


Filter  class  of  sets  over  B  . 
Containment  class  of  sets  over 


Prod,  prod. 

er 

e 

T 

Card . 

*>  13 


incidence  classes. 

Product  t-norm. 

Intuit^onlstlc  Krdnecker  delta  for 


Deduction  of  r  froi 

Cardinality  of 
Approximate  13,  etc. 


(2.5) 


List  of  Symbols 
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SJ 

P°P  '  ^A(pop) 


Choquet  operators . 

Upper  (lower)  probability. 

A -fuzzy  (or  additive)  measure. 

c  i  *"*  j  i 

Measure  corresponding  to  random  set  S  . 

Filter  class  or  subset  coverage  function  induced  by 

random  set  S  . 

Containment  class  or  superset  coverage  function 
induced  by  random  set  S  . 

Incidence  function  induced  by  random  set  S  . 

Random  set  corresponding  to  doubt  measure  p  . 

Belief  measure. 

Joint  collection  of  random  sets. 

Zero-one  stochastic  process. 

Tranpose  of  matrix  B  . 
n-dimensional  Lebesgue  measure . 

Collection  of  generalized  sets  (using  tilde). 

Weak-homomor  phi  sm . 

Lower,  upper  probability  bounds. 

Manes  theory  (T,e,#)  . 

Loss  function  (parameter  x,  decision  r). 

Certainty  factor  for  hypothesis  H  and  evidence  E. 
Conditional  posterior  posibility  function  for 

parameter (s)  8  given  data  ^  * 

Special  membership  functions  (Ch.  8). 

Observed,  theoretical  data  vectors  corresponding  to 
attribute  A^  for  track  history  j  .  (Ch.  9). 

Error  table  for  attribute  A^  (Ch.  9). 

tth  -inference  rule  (Ch.  9)  . 

Matching  table  (used  in  antecedent  of  inference 
rule)  for  A^. 

Lindley's  transform  for  score  function  f  . 

Indicates  a  sentence. 

Population,  membership  for  population. 


